Skip to main content

Showing 1–12 of 12 results for author: Peshterliev, S

  1. arXiv:2306.07499  [pdf, other

    cs.CL cs.AI cs.LG

    Improving Opinion-based Question Answering Systems Through Label Error Detection and Overwrite

    Authors: Xiao Yang, Ahmed K. Mohamed, Shashank Jain, Stanislav Peshterliev, Debojeet Chatterjee, Hanwen Zha, Nikita Bhalla, Gagan Aneja, Pranab Mohanty

    Abstract: Label error is a ubiquitous problem in annotated data. Large amounts of label error substantially degrades the quality of deep learning models. Existing methods to tackle the label error problem largely focus on the classification task, and either rely on task specific architecture or require non-trivial additional computations, which is undesirable or even unattainable for industry usage. In this… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  2. arXiv:2110.06918  [pdf, other

    cs.CL cs.IR cs.LG

    Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?

    Authors: Xilun Chen, Kushal Lakhotia, Barlas Oğuz, Anchit Gupta, Patrick Lewis, Stan Peshterliev, Yashar Mehdad, Sonal Gupta, Wen-tau Yih

    Abstract: Despite their recent popularity and well-known advantages, dense retrievers still lag behind sparse methods such as BM25 in their ability to reliably match salient phrases and rare entities in the query and to generalize to out-of-domain data. It has been argued that this is an inherent limitation of dense models. We rebut this claim by introducing the Salient Phrase Aware Retriever (SPAR), a dens… ▽ More

    Submitted 11 November, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

  3. arXiv:2108.02765  [pdf, other

    cs.CL

    Decoupled Transformer for Scalable Inference in Open-domain Question Answering

    Authors: Haytham ElFadeel, Stan Peshterliev

    Abstract: Large transformer models, such as BERT, achieve state-of-the-art results in machine reading comprehension (MRC) for open-domain question answering (QA). However, transformers have a high computational cost for inference which makes them hard to apply to online QA systems for applications like voice assistants. To reduce computational cost and latency, we propose decoupling the transformer MRC mode… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: RANLP 2021

  4. arXiv:2103.08809  [pdf, other

    cs.CL

    Robustly Optimized and Distilled Training for Natural Language Understanding

    Authors: Haytham ElFadeel, Stan Peshterliev

    Abstract: In this paper, we explore multi-task learning (MTL) as a second pretraining step to learn enhanced universal language representation for transformer language models. We use the MTL enhanced representation across several natural language understanding tasks to improve performance and generalization. Moreover, we incorporate knowledge distillation (KD) in MTL to further boost performance and devise… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

  5. arXiv:2103.06500  [pdf, other

    cs.CL

    Conversational Answer Generation and Factuality for Reading Comprehension Question-Answering

    Authors: Stan Peshterliev, Barlas Oguz, Debojeet Chatterjee, Hakan Inan, Vikas Bhardwaj

    Abstract: Question answering (QA) is an important use case on voice assistants. A popular approach to QA is extractive reading comprehension (RC) which finds an answer span in a text passage. However, extractive answers are often unnatural in a conversational context which results in suboptimal user experience. In this work, we investigate conversational answer generation for QA. We propose AnswerBART, an e… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

  6. arXiv:2101.00133  [pdf, other

    cs.CL cs.AI

    NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned

    Authors: Sewon Min, Jordan Boyd-Graber, Chris Alberti, Danqi Chen, Eunsol Choi, Michael Collins, Kelvin Guu, Hannaneh Hajishirzi, Kenton Lee, Jennimaria Palomaki, Colin Raffel, Adam Roberts, Tom Kwiatkowski, Patrick Lewis, Yuxiang Wu, Heinrich Küttler, Linqing Liu, Pasquale Minervini, Pontus Stenetorp, Sebastian Riedel, Sohee Yang, Minjoon Seo, Gautier Izacard, Fabio Petroni, Lucas Hosseini , et al. (28 additional authors not shown)

    Abstract: We review the EfficientQA competition from NeurIPS 2020. The competition focused on open-domain question answering (QA), where systems take natural language questions as input and return natural language answers. The aim of the competition was to build systems that can predict correct answers while also satisfying strict on-disk memory budgets. These memory budgets were designed to encourage conte… ▽ More

    Submitted 19 September, 2021; v1 submitted 31 December, 2020; originally announced January 2021.

    Comments: 26 pages; Published in Proceedings of Machine Learning Research (PMLR), NeurIPS 2020 Competition and Demonstration Track

  7. arXiv:2012.14610  [pdf, other

    cs.CL

    UniK-QA: Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering

    Authors: Barlas Oguz, Xilun Chen, Vladimir Karpukhin, Stan Peshterliev, Dmytro Okhonko, Michael Schlichtkrull, Sonal Gupta, Yashar Mehdad, Scott Yih

    Abstract: We study open-domain question answering with structured, unstructured and semi-structured knowledge sources, including text, tables, lists and knowledge bases. Departing from prior work, we propose a unifying approach that homogenizes all sources by reducing them to text and applies the retriever-reader model which has so far been limited to text sources only. Our approach greatly improves the res… ▽ More

    Submitted 3 May, 2022; v1 submitted 29 December, 2020; originally announced December 2020.

    Comments: NAACL-HLT 2022 Findings

  8. arXiv:2004.04060  [pdf, other

    cs.CL

    Self-Attention Gazetteer Embeddings for Named-Entity Recognition

    Authors: Stanislav Peshterliev, Christophe Dupuy, Imre Kiss

    Abstract: Recent attempts to ingest external knowledge into neural models for named-entity recognition (NER) have exhibited mixed results. In this work, we present GazSelfAttn, a novel gazetteer embedding approach that uses self-attention and match span encoding to build enhanced gazetteer embeddings. In addition, we demonstrate how to build gazetteer resources from the open source Wikidata knowledge base.… ▽ More

    Submitted 18 April, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: Preprint

  9. arXiv:1902.10649  [pdf, other

    cs.CL

    F10-SGD: Fast Training of Elastic-net Linear Models for Text Classification and Named-entity Recognition

    Authors: Stanislav Peshterliev, Alexander Hsieh, Imre Kiss

    Abstract: Voice-assistants text classification and named-entity recognition (NER) models are trained on millions of example utterances. Because of the large datasets, long training time is one of the bottlenecks for releasing improved models. In this work, we develop F10-SGD, a fast optimizer for text classification and NER elastic-net linear models. On internal datasets, F10-SGD provides 4x reduction in tr… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

  10. arXiv:1810.03450  [pdf, ps, other

    cs.CL cs.AI

    Active Learning for New Domains in Natural Language Understanding

    Authors: Stanislav Peshterliev, John Kearney, Abhyuday Jagannatha, Imre Kiss, Spyros Matsoukas

    Abstract: We explore active learning (AL) for improving the accuracy of new domains in a natural language understanding (NLU) system. We propose an algorithm called Majority-CRF that uses an ensemble of classification models to guide the selection of relevant utterances, as well as a sequence labeling model to help prioritize informative examples. Experiments with three domains show that Majority-CRF achiev… ▽ More

    Submitted 30 March, 2019; v1 submitted 3 October, 2018; originally announced October 2018.

    Comments: NAACL 2019

  11. arXiv:1807.07520  [pdf, ps, other

    cs.CL

    Statistical Model Compression for Small-Footprint Natural Language Understanding

    Authors: Grant P. Strimel, Kanthashree Mysore Sathyendra, Stanislav Peshterliev

    Abstract: In this paper we investigate statistical model compression applied to natural language understanding (NLU) models. Small-footprint NLU models are important for enabling offline systems on hardware restricted devices, and for decreasing on-demand model loading latency in cloud-based systems. To compress NLU models, we present two main techniques, parameter quantization and perfect feature hashing.… ▽ More

    Submitted 19 July, 2018; originally announced July 2018.

    Comments: Interspeech 2018

  12. arXiv:1711.00549  [pdf, other

    cs.CL cs.AI cs.NE cs.SE

    Just ASK: Building an Architecture for Extensible Self-Service Spoken Language Understanding

    Authors: Anjishnu Kumar, Arpit Gupta, Julian Chan, Sam Tucker, Bjorn Hoffmeister, Markus Dreyer, Stanislav Peshterliev, Ankur Gandhe, Denis Filiminov, Ariya Rastrow, Christian Monson, Agnika Kumar

    Abstract: This paper presents the design of the machine learning architecture that underlies the Alexa Skills Kit (ASK) a large scale Spoken Language Understanding (SLU) Software Development Kit (SDK) that enables developers to extend the capabilities of Amazon's virtual assistant, Alexa. At Amazon, the infrastructure powers over 25,000 skills deployed through the ASK, as well as AWS's Amazon Lex SLU Servic… ▽ More

    Submitted 2 March, 2018; v1 submitted 1 November, 2017; originally announced November 2017.

    Comments: Published at the 1st Workshop on Conversational AI at NIPS 2017 (NIPS-WCAI)

    MSC Class: 68T50