Skip to main content

Showing 1–2 of 2 results for author: Fernández, M P

  1. arXiv:2406.09940  [pdf, other

    q-bio.NC cs.AI cs.NE

    Implementing engrams from a machine learning perspective: XOR as a basic motif

    Authors: Jesus Marco de Lucas, Maria Peña Fernandez, Lara Lloret Iglesias

    Abstract: We have previously presented the idea of how complex multimodal information could be represented in our brains in a compressed form, following mechanisms similar to those employed in machine learning tools, like autoencoders. In this short comment note we reflect, mainly with a didactical purpose, upon the basic question for a biological implementation: what could be the mechanism working as a los… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 9 pages, short comment

  2. arXiv:2210.02833  [pdf, other

    cs.IR cs.CL cs.LG cs.SD eess.AS

    Matching Text and Audio Embeddings: Exploring Transfer-learning Strategies for Language-based Audio Retrieval

    Authors: Benno Weck, Miguel Pérez Fernández, Holger Kirchhoff, Xavier Serra

    Abstract: We present an analysis of large-scale pretrained deep learning models used for cross-modal (text-to-audio) retrieval. We use embeddings extracted by these models in a metric learning framework to connect matching pairs of audio and text. Shallow neural networks map the embeddings to a common dimensionality. Our system, which is an extension of our submission to the Language-based Audio Retrieval T… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: 5 pages, 2 figures. Accepted at Detection and Classification of Acoustic Scenes and Events 2022 (DCASE2022)