Skip to main content

Showing 1–23 of 23 results for author: Rojas-Barahona, L M

  1. arXiv:2407.02354  [pdf, other

    cs.CL cs.AI cs.HC

    Talking to Machines: do you read me?

    Authors: Lina M. Rojas-Barahona

    Abstract: In this dissertation I would like to guide the reader to the research on dialogue but more precisely the research I have conducted during my career since my PhD thesis. Starting from modular architectures with machine learning/deep learning and reinforcement learning to end-to-end deep neural networks. Besides my work as research associate, I also present the work I have supervised in the last yea… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: French Doctoral Habilitation HDR manuscript: https://hal.science/tel-04620199

  2. arXiv:2406.12945  [pdf, other

    cs.LG stat.ML

    Under the Hood of Tabular Data Generation Models: the Strong Impact of Hyperparameter Tuning

    Authors: G. Charbel N. Kindji, Lina Maria Rojas-Barahona, Elisa Fromont, Tanguy Urvoy

    Abstract: We investigate the impact of dataset-specific hyperparameter, feature encoding, and architecture tuning on five recent model families for tabular data generation through an extensive benchmark on 16 datasets. This study addresses the practical need for a unified evaluation of models that fully considers hyperparameter optimization. Additionally, we propose a reduced search space for each model tha… ▽ More

    Submitted 12 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2404.07836  [pdf, ps, other

    cs.CL

    Question Generation in Knowledge-Driven Dialog: Explainability and Evaluation

    Authors: Juliette Faille, Quentin Brabant, Gwenole Lecorve, Lina M. Rojas-Barahona, Claire Gardent

    Abstract: We explore question generation in the context of knowledge-grounded dialogs focusing on explainability and evaluation. Inspired by previous work on planning-based summarisation, we present a model which instead of directly generating a question, sequentially predicts first a fact then a question. We evaluate our approach on 37k test dialogs adapted from the KGConv dataset and we show that, althoug… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  4. arXiv:2401.03905  [pdf, other

    cs.CL

    WEBDial, a Multi-domain, Multitask Statistical Dialogue Framework with RDF

    Authors: Morgan Veyret, Jean-Baptiste Duchene, Kekeli Afonouvi, Quentin Brabant, Gwenole Lecorve, Lina M. Rojas-Barahona

    Abstract: Typically available dialogue frameworks have adopted a semantic representation based on dialogue-acts and slot-value pairs. Despite its simplicity, this representation has disadvantages such as the lack of expressivity, scalability and explainability. We present WEBDial: a dialogue framework that relies on a graph formalism by using RDF triples instead of slot-value pairs. We describe its overall… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  5. Unsupervised Auditory and Semantic Entrainment Models with Deep Neural Networks

    Authors: Jay Kejriwal, Stefan Benus, Lina M. Rojas-Barahona

    Abstract: Speakers tend to engage in adaptive behavior, known as entrainment, when they become similar to their interlocutor in various aspects of speaking. We present an unsupervised deep learning framework that derives meaningful representation from textual features for developing semantic entrainment. We investigate the model's performance by extracting features using different variations of the BERT mod… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: Interspeech2023

  6. arXiv:2311.04922  [pdf, other

    cs.CL cs.AI eess.AS eess.SP

    Are cascade dialogue state tracking models speaking out of turn in spoken dialogues?

    Authors: Lucas Druart, Léo Jacqmin, Benoît Favre, Lina Maria Rojas-Barahona, Valentin Vielzeuf

    Abstract: In Task-Oriented Dialogue (TOD) systems, correctly updating the system's understanding of the user's needs is key to a smooth interaction. Traditionally TOD systems are composed of several modules that interact with one another. While each of these components is the focus of active research communities, their behavior in interaction can be overlooked. This paper proposes a comprehensive analysis o… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: Submitted to IEEE ICASSP 2024© 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  7. arXiv:2308.15298  [pdf, other

    cs.CL cs.AI

    KGConv, a Conversational Corpus grounded in Wikidata

    Authors: Quentin Brabant, Gwenole Lecorve, Lina M. Rojas-Barahona, Claire Gardent

    Abstract: We present KGConv, a large, conversational corpus of 71k conversations where each question-answer pair is grounded in a Wikidata fact. Conversations contain on average 8.6 questions and for each Wikidata fact, we provide multiple variants (12 on average) of the corresponding question using templates, human annotations, hand-crafted rules and a question rewriting neural model. We provide baselines… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  8. Interpreting Vision and Language Generative Models with Semantic Visual Priors

    Authors: Michele Cafagna, Lina M. Rojas-Barahona, Kees van Deemter, Albert Gatt

    Abstract: When applied to Image-to-text models, interpretability methods often provide token-by-token explanations namely, they compute a visual explanation for each token of the generated sequence. Those explanations are expensive to compute and unable to comprehensively explain the model's output. Therefore, these models often require some sort of approximation that eventually leads to misleading explanat… ▽ More

    Submitted 4 May, 2023; v1 submitted 28 April, 2023; originally announced April 2023.

  9. arXiv:2304.11073  [pdf, other

    eess.AS cs.AI cs.CL cs.SD

    OLISIA: a Cascade System for Spoken Dialogue State Tracking

    Authors: Léo Jacqmin, Lucas Druart, Yannick Estève, Benoît Favre, Lina Maria Rojas-Barahona, Valentin Vielzeuf

    Abstract: Though Dialogue State Tracking (DST) is a core component of spoken dialogue systems, recent work on this task mostly deals with chat corpora, disregarding the discrepancies between spoken and written language.In this paper, we propose OLISIA, a cascade system which integrates an Automatic Speech Recognition (ASR) model and a DST model. We introduce several adaptations in the ASR and DST modules to… ▽ More

    Submitted 31 August, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

  10. arXiv:2302.11199  [pdf, other

    cs.CL

    Few-Shot Structured Policy Learning for Multi-Domain and Multi-Task Dialogues

    Authors: Thibault Cordier, Tanguy Urvoy, Fabrice Lefevre, Lina M. Rojas-Barahona

    Abstract: Reinforcement learning has been widely adopted to model dialogue managers in task-oriented dialogues. However, the user simulator provided by state-of-the-art dialogue frameworks are only rough approximations of human behaviour. The ability to learn from a small number of human interactions is hence crucial, especially on multi-domain and multi-task environments where the action space is large. We… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: 8 pages, at the EACL2023 conference (Findings)

  11. arXiv:2302.05900  [pdf, other

    cs.CL

    Investigating the Effect of Relative Positional Embeddings on AMR-to-Text Generation with Structural Adapters

    Authors: Sebastien Montella, Alexis Nasr, Johannes Heinecke, Frederic Bechet, Lina M. Rojas-Barahona

    Abstract: Text generation from Abstract Meaning Representation (AMR) has substantially benefited from the popularized Pretrained Language Models (PLMs). Myriad approaches have linearized the input graph as a sequence of tokens to fit the PLM tokenization requirements. Nevertheless, this transformation jeopardizes the structural integrity of the graph and is therefore detrimental to its resulting representat… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

    Comments: Accepted to EACL 2023 (Main)

  12. arXiv:2210.05252  [pdf, other

    cs.CL

    Graph Neural Network Policies and Imitation Learning for Multi-Domain Task-Oriented Dialogues

    Authors: Thibault Cordier, Tanguy Urvoy, Fabrice Lefèvre, Lina M. Rojas-Barahona

    Abstract: Task-oriented dialogue systems are designed to achieve specific goals while conversing with humans. In practice, they may have to handle simultaneously several domains and tasks. The dialogue manager must therefore be able to take into account domain changes and plan over different domains/tasks in order to deal with multidomain dialogues. However, learning with reinforcement in such context becom… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Journal ref: SIGDIAL 2022

  13. arXiv:2207.14627  [pdf, other

    cs.CL

    "Do you follow me?": A Survey of Recent Approaches in Dialogue State Tracking

    Authors: Léo Jacqmin, Lina M. Rojas-Barahona, Benoit Favre

    Abstract: While communicating with a user, a task-oriented dialogue system has to track the user's needs at each turn according to the conversation history. This process called dialogue state tracking (DST) is crucial because it directly informs the downstream dialogue policy. DST has received a lot of interest in recent years with the text-to-text paradigm emerging as the favored approach. In this review p… ▽ More

    Submitted 29 July, 2022; originally announced July 2022.

    Comments: SIGDIAL 2022

  14. arXiv:2207.03240  [pdf, ps, other

    cs.CL

    CoQAR: Question Rewriting on CoQA

    Authors: Quentin Brabant, Gwenole Lecorve, Lina M. Rojas-Barahona

    Abstract: Questions asked by humans during a conversation often contain contextual dependencies, i.e., explicit or implicit references to previous dialogue turns. These dependencies take the form of coreferences (e.g., via pronoun use) or ellipses, and can make the understanding difficult for automated systems. One way to facilitate the understanding and subsequent treatments of a question is to rewrite it… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: Published in LREC2022

  15. arXiv:2207.03145  [pdf, other

    cs.CL

    Active Learning and Multi-label Classification for Ellipsis and Coreference Detection in Conversational Question-Answering

    Authors: Quentin Brabant, Lina Maria Rojas-Barahona, Claire Gardent

    Abstract: In human conversations, ellipsis and coreference are commonly occurring linguistic phenomena. Although these phenomena are a mean of making human-machine conversations more fluent and natural, only few dialogue corpora contain explicit indications on which turns contain ellipses and/or coreferences. In this paper we address the task of automatically detecting ellipsis and coreferences in conversat… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: Published in IWSDS 2021

  16. arXiv:2101.05004  [pdf, other

    cs.CL cs.AI cs.HC

    Is the User Enjoying the Conversation? A Case Study on the Impact on the Reward Function

    Authors: Lina M. Rojas-Barahona

    Abstract: The impact of user satisfaction in policy learning task-oriented dialogue systems has long been a subject of research interest. Most current models for estimating the user satisfaction either (i) treat out-of-context short-texts, such as product reviews, or (ii) rely on turn features instead of on distributed semantic representations. In this work we adopt deep neural networks that use distributed… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

    Comments: Accepted at the Human in the Loop Dialogue Systems, 34st Conference on Neural Information Processing Systems (NeurIPS 2020). Paper updated with minor changes

  17. arXiv:2012.04687  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation

    Authors: Thibault Cordier, Tanguy Urvoy, Lina M. Rojas-Barahona, Fabrice Lefèvre

    Abstract: A learning dialogue agent can infer its behaviour from interactions with the users. These interactions can be taken from either human-to-human or human-machine conversations. However, human interactions are scarce and costly, making learning from few interactions essential. One solution to speedup the learning process is to guide the agent's exploration with the help of an expert. We present in th… ▽ More

    Submitted 25 November, 2020; originally announced December 2020.

    Comments: 8 pages, Accepted at Human in the Loop Dialogue Systems Workshop, NeurIPS 2020

  18. arXiv:1909.11980  [pdf, other

    cs.AI cs.CL cs.HC

    Spoken Conversational Search for General Knowledge

    Authors: Lina M. Rojas-Barahona, Pascal Bellec, Benoit Besset, Martinho Dos-Santos, Johannes Heinecke, Munshi Asadullah, Olivier Le-Blouch, Jean Y. Lancien, Géraldine Damnati, Emmanuel Mory, Frédéric Herledan

    Abstract: We present a spoken conversational question answering proof of concept that is able to answer questions about general knowledge from Wikidata. The dialogue component does not only orchestrate various components but also solve coreferences and ellipsis.

    Submitted 26 September, 2019; originally announced September 2019.

    Comments: SIGDial2019

  19. arXiv:1806.05484  [pdf, other

    cs.CL cs.AI

    Nearly Zero-Shot Learning for Semantic Decoding in Spoken Dialogue Systems

    Authors: Lina M. Rojas-Barahona, Stefan Ultes, Pawel Budzianowski, Iñigo Casanueva, Milica Gasic, Bo-Hsiang Tseng, Steve Young

    Abstract: This paper presents two ways of dealing with scarce data in semantic decoding using N-Best speech recognition hypotheses. First, we learn features by using a deep learning architecture in which the weights for the unknown and known categories are jointly optimised. Second, an unsupervised method is used for further tuning the weights. Sharing weights injects prior knowledge to unknown categories.… ▽ More

    Submitted 21 June, 2018; v1 submitted 14 June, 2018; originally announced June 2018.

  20. arXiv:1609.02846  [pdf, other

    cs.CL

    Dialogue manager domain adaptation using Gaussian process reinforcement learning

    Authors: Milica Gasic, Nikola Mrksic, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, Steve Young

    Abstract: Spoken dialogue systems allow humans to interact with machines using natural speech. As such, they have many benefits. By using speech as the primary communication medium, a computer interface can facilitate swift, human-like acquisition of information. In recent years, speech interfaces have become ever more popular, as is evident from the rise of personal assistants such as Siri, Google Now, Cor… ▽ More

    Submitted 9 September, 2016; originally announced September 2016.

    Comments: accepted for publication in Computer Speech and Language

  21. arXiv:1606.03352  [pdf, other

    cs.CL cs.NE stat.ML

    Conditional Generation and Snapshot Learning in Neural Dialogue Systems

    Authors: Tsung-Hsien Wen, Milica Gasic, Nikola Mrksic, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, David Vandyke, Steve Young

    Abstract: Recently a variety of LSTM-based conditional language models (LM) have been applied across a range of language generation tasks. In this work we study various model architectures and different ways to represent and aggregate the source information in an end-to-end neural dialogue system framework. A method called snapshot learning is also proposed to facilitate learning from supervised sequential… ▽ More

    Submitted 10 June, 2016; originally announced June 2016.

  22. arXiv:1604.04562  [pdf, other

    cs.CL cs.AI cs.NE stat.ML

    A Network-based End-to-End Trainable Task-oriented Dialogue System

    Authors: Tsung-Hsien Wen, David Vandyke, Nikola Mrksic, Milica Gasic, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, Steve Young

    Abstract: Teaching machines to accomplish tasks by conversing naturally with humans is challenging. Currently, developing task-oriented dialogue systems requires creating multiple components and typically this involves either a large amount of handcrafting, or acquiring costly labelled datasets to solve a statistical learning problem for each component. In this work we introduce a neural network-based text-… ▽ More

    Submitted 24 April, 2017; v1 submitted 15 April, 2016; originally announced April 2016.

    Comments: published at EACL 2017

  23. arXiv:1603.01232  [pdf, other

    cs.CL

    Multi-domain Neural Network Language Generation for Spoken Dialogue Systems

    Authors: Tsung-Hsien Wen, Milica Gasic, Nikola Mrksic, Lina M. Rojas-Barahona, Pei-Hao Su, David Vandyke, Steve Young

    Abstract: Moving from limited-domain natural language generation (NLG) to open domain is difficult because the number of semantic input combinations grows exponentially with the number of domains. Therefore, it is important to leverage existing resources and exploit similarities between domains to facilitate domain adaptation. In this paper, we propose a procedure to train multi-domain, Recurrent Neural Net… ▽ More

    Submitted 3 March, 2016; originally announced March 2016.

    Comments: Accepted as a long paper in NAACL-HLT 2016