Skip to main content

Showing 1–18 of 18 results for author: Vandyke, D

  1. arXiv:2402.10137  [pdf, other

    cs.CL

    TOAD: Task-Oriented Automatic Dialogs with Diverse Response Styles

    Authors: Yinhong Liu, Yimai Fang, David Vandyke, Nigel Collier

    Abstract: In light of recent advances in large language models (LLMs), the expectations for the next generation of virtual assistants include enhanced naturalness and adaptability across diverse usage scenarios. However, the creation of high-quality annotated data for Task-Oriented Dialog (TOD) is recognized to be slow and costly. To address these challenges, we introduce Task-Oriented Automatic Dialogs (TO… ▽ More

    Submitted 6 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted by ACL 2024

  2. arXiv:2212.01757  [pdf, other

    cs.CL cs.AI cs.LG

    Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer

    Authors: Benjamin Muller, Deepanshu Gupta, Siddharth Patwardhan, Jean-Philippe Fauconnier, David Vandyke, Sachin Agarwal

    Abstract: Multi-lingual language models (LM), such as mBERT, XLM-R, mT5, mBART, have been remarkably successful in enabling natural language tasks in low-resource languages through cross-lingual transfer from high-resource ones. In this work, we try to better understand how such models, specifically mT5, transfer *any* linguistic and semantic knowledge across languages, even though no explicit cross-lingual… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

    Comments: In NeurIPS Workshop on Transfer Learning for Natural Language Processing, 2022, New Orleans. 15 pages, 8 figures, 5 tables

    MSC Class: 68T07 ACM Class: I.2.7; I.2.6

  3. arXiv:2209.11068  [pdf, other

    cs.CL

    Prompting for a conversation: How to control a dialog model?

    Authors: Josef Valvoda, Yimai Fang, David Vandyke

    Abstract: Dialog modelling faces a difficult trade-off. Models are trained on a large amount of text, yet their responses need to be limited to a desired scope and style of a dialog agent. Because the datasets used to achieve the former contain language that is not compatible with the latter, pre-trained dialog models are fine-tuned on smaller curated datasets. However, the fine-tuning process robs them of… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  4. arXiv:2108.13740  [pdf, other

    cs.CL

    Plan-then-Generate: Controlled Data-to-Text Generation via Planning

    Authors: Yixuan Su, David Vandyke, Sihui Wang, Yimai Fang, Nigel Collier

    Abstract: Recent developments in neural networks have led to the advance in data-to-text generation. However, the lack of ability of neural models to control the structure of generated output can be limiting in certain real-world applications. In this study, we propose a novel Plan-then-Generate (PlanGen) framework to improve the controllability of neural data-to-text models. Extensive experiments and analy… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

    Comments: Accepted to Findings of EMNLP 2021

  5. arXiv:2102.08220  [pdf, other

    cs.CL

    Non-Autoregressive Text Generation with Pre-trained Language Models

    Authors: Yixuan Su, Deng Cai, Yan Wang, David Vandyke, Simon Baker, Piji Li, Nigel Collier

    Abstract: Non-autoregressive generation (NAG) has recently attracted great attention due to its fast inference speed. However, the generation quality of existing NAG models still lags behind their autoregressive counterparts. In this work, we show that BERT can be employed as the backbone of a NAG model to greatly improve performance. Additionally, we devise mechanisms to alleviate the two common problems o… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

    Comments: Accepted to EACL 2021

  6. arXiv:2006.07499  [pdf, other

    cs.CL cs.AI

    A Generative Model for Joint Natural Language Understanding and Generation

    Authors: Bo-Hsiang Tseng, Jianpeng Cheng, Yimai Fang, David Vandyke

    Abstract: Natural language understanding (NLU) and natural language generation (NLG) are two fundamental and related tasks in building task-oriented dialogue systems with opposite objectives: NLU tackles the transformation from natural language to formal representations, whereas NLG does the reverse. A key to success in either task is parallel training data which is expensive to obtain at a large scale. In… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

    Comments: The 58th Annual Meeting of the Association for Computational Linguistics, ACL2020

  7. arXiv:1609.02846  [pdf, other

    cs.CL

    Dialogue manager domain adaptation using Gaussian process reinforcement learning

    Authors: Milica Gasic, Nikola Mrksic, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, Steve Young

    Abstract: Spoken dialogue systems allow humans to interact with machines using natural speech. As such, they have many benefits. By using speech as the primary communication medium, a computer interface can facilitate swift, human-like acquisition of information. In recent years, speech interfaces have become ever more popular, as is evident from the rise of personal assistants such as Siri, Google Now, Cor… ▽ More

    Submitted 9 September, 2016; originally announced September 2016.

    Comments: accepted for publication in Computer Speech and Language

  8. arXiv:1606.03352  [pdf, other

    cs.CL cs.NE stat.ML

    Conditional Generation and Snapshot Learning in Neural Dialogue Systems

    Authors: Tsung-Hsien Wen, Milica Gasic, Nikola Mrksic, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, David Vandyke, Steve Young

    Abstract: Recently a variety of LSTM-based conditional language models (LM) have been applied across a range of language generation tasks. In this work we study various model architectures and different ways to represent and aggregate the source information in an end-to-end neural dialogue system framework. A method called snapshot learning is also proposed to facilitate learning from supervised sequential… ▽ More

    Submitted 10 June, 2016; originally announced June 2016.

  9. arXiv:1606.02689  [pdf, other

    cs.CL cs.LG

    Continuously Learning Neural Dialogue Management

    Authors: Pei-Hao Su, Milica Gasic, Nikola Mrksic, Lina Rojas-Barahona, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, Steve Young

    Abstract: We describe a two-step approach for dialogue management in task-oriented spoken dialogue systems. A unified neural network framework is proposed to enable the system to first learn by supervision from a set of dialogue data and then continuously improve its behaviour via reinforcement learning, all using gradient-based algorithms on one single model. The experiments demonstrate the supervised mode… ▽ More

    Submitted 8 June, 2016; originally announced June 2016.

  10. arXiv:1605.07669  [pdf, other

    cs.CL cs.LG

    On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems

    Authors: Pei-Hao Su, Milica Gasic, Nikola Mrksic, Lina Rojas-Barahona, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, Steve Young

    Abstract: The ability to compute an accurate reward function is essential for optimising a dialogue policy via reinforcement learning. In real-world applications, using explicit user feedback as the reward signal is often unreliable and costly to collect. This problem can be mitigated if the user's intent is known in advance or data is available to pre-train a task success predictor off-line. In practice ne… ▽ More

    Submitted 2 June, 2016; v1 submitted 24 May, 2016; originally announced May 2016.

    Comments: Accepted as a long paper in ACL 2016

  11. arXiv:1604.04562  [pdf, other

    cs.CL cs.AI cs.NE stat.ML

    A Network-based End-to-End Trainable Task-oriented Dialogue System

    Authors: Tsung-Hsien Wen, David Vandyke, Nikola Mrksic, Milica Gasic, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, Steve Young

    Abstract: Teaching machines to accomplish tasks by conversing naturally with humans is challenging. Currently, developing task-oriented dialogue systems requires creating multiple components and typically this involves either a large amount of handcrafting, or acquiring costly labelled datasets to solve a statistical learning problem for each component. In this work we introduce a neural network-based text-… ▽ More

    Submitted 24 April, 2017; v1 submitted 15 April, 2016; originally announced April 2016.

    Comments: published at EACL 2017

  12. arXiv:1603.01232  [pdf, other

    cs.CL

    Multi-domain Neural Network Language Generation for Spoken Dialogue Systems

    Authors: Tsung-Hsien Wen, Milica Gasic, Nikola Mrksic, Lina M. Rojas-Barahona, Pei-Hao Su, David Vandyke, Steve Young

    Abstract: Moving from limited-domain natural language generation (NLG) to open domain is difficult because the number of semantic input combinations grows exponentially with the number of domains. Therefore, it is important to leverage existing resources and exploit similarities between domains to facilitate domain adaptation. In this paper, we propose a procedure to train multi-domain, Recurrent Neural Net… ▽ More

    Submitted 3 March, 2016; originally announced March 2016.

    Comments: Accepted as a long paper in NAACL-HLT 2016

  13. arXiv:1603.00892  [pdf, other

    cs.CL cs.LG

    Counter-fitting Word Vectors to Linguistic Constraints

    Authors: Nikola Mrkšić, Diarmuid Ó Séaghdha, Blaise Thomson, Milica Gašić, Lina Rojas-Barahona, Pei-Hao Su, David Vandyke, Tsung-Hsien Wen, Steve Young

    Abstract: In this work, we present a novel counter-fitting method which injects antonymy and synonymy constraints into vector space representations in order to improve the vectors' capability for judging semantic similarity. Applying this method to publicly available pre-trained word vectors leads to a new state of the art performance on the SimLex-999 dataset. We also show how the method can be used to tai… ▽ More

    Submitted 2 March, 2016; originally announced March 2016.

    Comments: Paper accepted for the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2016)

  14. arXiv:1508.03391  [pdf, other

    cs.LG cs.CL

    Reward Shaping with Recurrent Neural Networks for Speeding up On-Line Policy Learning in Spoken Dialogue Systems

    Authors: Pei-Hao Su, David Vandyke, Milica Gasic, Nikola Mrksic, Tsung-Hsien Wen, Steve Young

    Abstract: Statistical spoken dialogue systems have the attractive property of being able to be optimised from data via interactions with real users. However in the reinforcement learning paradigm the dialogue manager (agent) often requires significant time to explore the state-action space to learn to behave in a desirable manner. This is a critical issue when the system is trained on-line with real users w… ▽ More

    Submitted 18 August, 2015; v1 submitted 13 August, 2015; originally announced August 2015.

    Comments: Accepted for publication in SigDial 2015

  15. arXiv:1508.03386  [pdf, other

    cs.LG cs.CL

    Learning from Real Users: Rating Dialogue Success with Neural Networks for Reinforcement Learning in Spoken Dialogue Systems

    Authors: Pei-Hao Su, David Vandyke, Milica Gasic, Dongho Kim, Nikola Mrksic, Tsung-Hsien Wen, Steve Young

    Abstract: To train a statistical spoken dialogue system (SDS) it is essential that an accurate method for measuring task success is available. To date training has relied on presenting a task to either simulated or paid users and inferring the dialogue's success by observing whether this presented task was achieved or not. Our aim however is to be able to learn from real users acting under their own volitio… ▽ More

    Submitted 13 August, 2015; originally announced August 2015.

    Comments: Accepted for publication in INTERSPEECH 2015

  16. arXiv:1508.01755  [pdf, other

    cs.CL

    Stochastic Language Generation in Dialogue using Recurrent Neural Networks with Convolutional Sentence Reranking

    Authors: Tsung-Hsien Wen, Milica Gasic, Dongho Kim, Nikola Mrksic, Pei-Hao Su, David Vandyke, Steve Young

    Abstract: The natural language generation (NLG) component of a spoken dialogue system (SDS) usually needs a substantial amount of handcrafting or a well-labeled dataset to be trained on. These limitations add significantly to development costs and make cross-domain, multi-lingual dialogue systems intractable. Moreover, human languages are context-aware. The most natural response should be directly learned f… ▽ More

    Submitted 7 August, 2015; originally announced August 2015.

    Comments: To be appear in SigDial 2015

  17. arXiv:1508.01745  [pdf, other

    cs.CL

    Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems

    Authors: Tsung-Hsien Wen, Milica Gasic, Nikola Mrksic, Pei-Hao Su, David Vandyke, Steve Young

    Abstract: Natural language generation (NLG) is a critical component of spoken dialogue and it has a significant impact both on usability and perceived quality. Most NLG systems in common use employ rules and heuristics and tend to generate rigid and stylised responses without the natural variation of human language. They are also not easily scaled to systems covering multiple domains and languages. This pap… ▽ More

    Submitted 26 August, 2015; v1 submitted 7 August, 2015; originally announced August 2015.

    Comments: To be appear in EMNLP 2015

  18. arXiv:1506.07190  [pdf, other

    cs.CL cs.LG

    Multi-domain Dialog State Tracking using Recurrent Neural Networks

    Authors: Nikola Mrkšić, Diarmuid Ó Séaghdha, Blaise Thomson, Milica Gašić, Pei-Hao Su, David Vandyke, Tsung-Hsien Wen, Steve Young

    Abstract: Dialog state tracking is a key component of many modern dialog systems, most of which are designed with a single, well-defined domain in mind. This paper shows that dialog data drawn from different dialog domains can be used to train a general belief tracking model which can operate across all of these domains, exhibiting superior performance to each of the domain-specific models. We propose a tra… ▽ More

    Submitted 23 June, 2015; originally announced June 2015.

    Comments: Accepted as a short paper in the 53rd Annual Meeting of the Association for Computational Linguistics (ACL 2015)