Skip to main content

Showing 1–8 of 8 results for author: Bordes, P

  1. arXiv:2406.14150  [pdf, other

    cs.LG

    Multi-modal Transfer Learning between Biological Foundation Models

    Authors: Juan Jose Garau-Luis, Patrick Bordes, Liam Gonzalez, Masa Roller, Bernardo P. de Almeida, Lorenz Hexemer, Christopher Blum, Stefan Laurent, Jan Grzegorzewski, Maren Lang, Thomas Pierrot, Guillaume Richard

    Abstract: Biological sequences encode fundamental instructions for the building blocks of life, in the form of DNA, RNA, and proteins. Modeling these sequences is key to understand disease mechanisms and is an active research area in computational biology. Recently, Large Language Models have shown great promise in solving certain biological tasks but current approaches are limited to a single sequence moda… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    MSC Class: 68T07 (Primary)

  2. arXiv:2202.03149  [pdf, other

    eess.IV cs.LG

    Neural Network based Inter bi-prediction Blending

    Authors: Franck Galpin, Philippe Bordes, Thierry Dumas, Pavel Nikitin, Fabrice Le Leannec

    Abstract: This paper presents a learning-based method to improve bi-prediction in video coding. In conventional video coding solutions, the motion compensation of blocks from already decoded reference pictures stands out as the principal tool used to predict the current frame. Especially, the bi-prediction, in which a block is obtained by averaging two different motion-compensated prediction blocks, signifi… ▽ More

    Submitted 26 January, 2022; originally announced February 2022.

    Journal ref: VCIP 2021

  3. arXiv:2011.06850  [pdf, other

    cs.CV cs.AI

    Transductive Zero-Shot Learning using Cross-Modal CycleGAN

    Authors: Patrick Bordes, Eloi Zablocki, Benjamin Piwowarski, Patrick Gallinari

    Abstract: In Computer Vision, Zero-Shot Learning (ZSL) aims at classifying unseen classes -- classes for which no matching training image exists. Most of ZSL works learn a cross-modal mapping between images and class labels for seen classes. However, the data distribution of seen and unseen classes might differ, causing a domain shift problem. Following this observation, transductive ZSL (T-ZSL) assumes tha… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

  4. CNN-based driving of block partitioning for intra slices encoding

    Authors: Franck Galpin, Fabien Racapé, Sunil Jaiswal, Philippe Bordes, Fabrice Le Léannec, Edouard François

    Abstract: This paper provides a technical overview of a deep-learning-based encoder method aiming at optimizing next generation hybrid video encoders for driving the block partitioning in intra slices. An encoding approach based on Convolutional Neural Networks is explored to partly substitute classical heuristics-based encoder speed-ups by a systematic and automatic process. The solution allows controlling… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: 10 pages

    Journal ref: 2019 Data Compression Conference (DCC)

  5. arXiv:2003.06812  [pdf, other

    eess.IV cs.CV cs.LG

    Iterative training of neural networks for intra prediction

    Authors: Thierry Dumas, Franck Galpin, Philippe Bordes

    Abstract: This paper presents an iterative training of neural networks for intra prediction in a block-based image and video codec. First, the neural networks are trained on blocks arising from the codec partitioning of images, each paired with its context. Then, iteratively, blocks are collected from the partitioning of images via the codec including the neural networks trained at the previous iteration, e… ▽ More

    Submitted 25 November, 2020; v1 submitted 15 March, 2020; originally announced March 2020.

    Comments: 15 pages, 16 figures

  6. arXiv:2002.10832  [pdf, other

    cs.CL cs.CV cs.LG

    What BERT Sees: Cross-Modal Transfer for Visual Question Generation

    Authors: Thomas Scialom, Patrick Bordes, Paul-Alexis Dray, Jacopo Staiano, Patrick Gallinari

    Abstract: Pre-trained language models have recently contributed to significant advances in NLP tasks. Recently, multi-modal versions of BERT have been developed, using heavy pre-training relying on vast corpora of aligned textual and image data, primarily applied to classification tasks such as VQA. In this paper, we are interested in evaluating the visual capabilities of BERT out-of-the-box, by avoiding pr… ▽ More

    Submitted 16 December, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: INLG 2020

  7. arXiv:2002.02734  [pdf, other

    cs.CL

    Incorporating Visual Semantics into Sentence Representations within a Grounded Space

    Authors: Patrick Bordes, Eloi Zablocki, Laure Soulier, Benjamin Piwowarski, Patrick Gallinari

    Abstract: Language grounding is an active field aiming at enriching textual representations with visual information. Generally, textual and visual elements are embedded in the same representation space, which implicitly assumes a one-to-one correspondence between modalities. This hypothesis does not hold when representing words, and becomes problematic when used to learn sentence representations --- the foc… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

  8. arXiv:1904.12638  [pdf, other

    cs.CV cs.CL cs.LG stat.ML

    Context-Aware Zero-Shot Learning for Object Recognition

    Authors: Eloi Zablocki, Patrick Bordes, Benjamin Piwowarski, Laure Soulier, Patrick Gallinari

    Abstract: Zero-Shot Learning (ZSL) aims at classifying unlabeled objects by leveraging auxiliary knowledge, such as semantic representations. A limitation of previous approaches is that only intrinsic properties of objects, e.g. their visual appearance, are taken into account while their context, e.g. the surrounding objects in the image, is ignored. Following the intuitive principle that objects tend to be… ▽ More

    Submitted 30 April, 2019; v1 submitted 24 April, 2019; originally announced April 2019.

    Comments: Accepted at ICML 2019