Skip to main content

Showing 1–10 of 10 results for author: Gagne, C

  1. arXiv:2403.14048  [pdf, ps, other

    cs.SD cs.CL eess.AS

    The NeurIPS 2023 Machine Learning for Audio Workshop: Affective Audio Benchmarks and Novel Data

    Authors: Alice Baird, Rachel Manzelli, Panagiotis Tzirakis, Chris Gagne, Haoqi Li, Sadie Allen, Sander Dieleman, Brian Kulis, Shrikanth S. Narayanan, Alan Cowen

    Abstract: The NeurIPS 2023 Machine Learning for Audio Workshop brings together machine learning (ML) experts from various audio domains. There are several valuable audio-driven ML tasks, from speech emotion recognition to audio event detection, but the community is sparse compared to other ML areas, e.g., computer vision or natural language processing. A major limitation with audio is the available data; wi… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  2. arXiv:2403.09828  [pdf, other

    eess.IV cs.CV

    Analyzing Data Augmentation for Medical Images: A Case Study in Ultrasound Images

    Authors: Adam Tupper, Christian Gagné

    Abstract: Data augmentation is one of the most effective techniques to improve the generalization performance of deep neural networks. Yet, despite often facing limited data availability in medical image analysis, it is frequently underutilized. This appears to be due to a gap in our collective understanding of the efficacy of different augmentation techniques across medical imaging tasks and modalities. On… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: For associated code, see https://github.com/adamtupper/medical-image-augmentation

  3. arXiv:2312.05357  [pdf, other

    eess.IV cs.CV

    Filtering Pixel Latent Variables for Unmixing Noisy and Undersampled Volumetric Images

    Authors: Catherine Bouchard, Andréanne Deschênes, Vincent Boulanger, Jean-Michel Bellavance, Flavie Lavoie-Cardinal, Christian Gagné

    Abstract: The development of robust signal unmixing algorithms is essential for leveraging multimodal datasets acquired through a wide array of scientific imaging technologies, including hyperspectral or time-resolved acquisitions. In experimental physics, enhancing the spatio-temporal resolution or expanding the number of detection channels often leads to diminished sampling rate and signal-to-noise ratio,… ▽ More

    Submitted 5 April, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: 16 pages, 8 figures (main paper) + 18 pages, 9 figures (supplementary material)

  4. arXiv:2305.05023  [pdf, other

    eess.IV cs.CV cs.LG

    Domain Agnostic Image-to-image Translation using Low-Resolution Conditioning

    Authors: Mohamed Abid, Arman Afrasiyabi, Ihsen Hedhli, Jean-François Lalonde, Christian Gagné

    Abstract: Generally, image-to-image translation (i2i) methods aim at learning mappings across domains with the assumption that the images used for translation share content (e.g., pose) but have their own domain-specific information (a.k.a. style). Conditioned on a target image, such methods extract the target style and combine it with the source image content, keeping coherence between the domains. In our… ▽ More

    Submitted 10 May, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: 19 pages, 23 figures. arXiv admin note: substantial text overlap with arXiv:2107.11262. Under consideration in Computer Vision and Image Understanding

  5. arXiv:2304.14882  [pdf, other

    cs.SD cs.LG eess.AS

    The ACM Multimedia 2023 Computational Paralinguistics Challenge: Emotion Share & Requests

    Authors: Björn W. Schuller, Anton Batliner, Shahin Amiriparian, Alexander Barnhill, Maurice Gerczuk, Andreas Triantafyllopoulos, Alice Baird, Panagiotis Tzirakis, Chris Gagne, Alan S. Cowen, Nikola Lackovic, Marie-José Caraty, Claude Montacié

    Abstract: The ACM Multimedia 2023 Computational Paralinguistics Challenge addresses two different problems for the first time in a research competition under well-defined conditions: In the Emotion Share Sub-Challenge, a regression on speech has to be made; and in the Requests Sub-Challenges, requests and complaints need to be detected. We describe the Sub-Challenges, baseline feature extraction, and classi… ▽ More

    Submitted 1 May, 2023; v1 submitted 28 April, 2023; originally announced April 2023.

    Comments: 5 pages, part of the ACM Multimedia 2023 Grand Challenge "The ACM Multimedia 2023 Computational Paralinguistics Challenge (ComParE 2023). arXiv admin note: text overlap with arXiv:2205.06799

    MSC Class: 68 ACM Class: I.2.7; I.5.0; J.3

  6. arXiv:2107.11262  [pdf, other

    cs.CV eess.IV

    Image-to-Image Translation with Low Resolution Conditioning

    Authors: Mohamed Abderrahmen Abid, Ihsen Hedhli, Jean-François Lalonde, Christian Gagne

    Abstract: Most image-to-image translation methods focus on learning mappings across domains with the assumption that images share content (e.g., pose) but have their own domain-specific information known as style. When conditioned on a target image, such methods aim to extract the style of the target and combine it with the content of the source image. In this work, we consider the scenario where the target… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

  7. arXiv:2106.12628  [pdf, other

    cs.CV eess.IV

    Florida Wildlife Camera Trap Dataset

    Authors: Crystal Gagne, Jyoti Kini, Daniel Smith, Mubarak Shah

    Abstract: Trail camera imagery has increasingly gained popularity amongst biologists for conservation and ecological research. Minimal human interference required to operate camera traps allows capturing unbiased species activities. Several studies - based on human and wildlife interactions, migratory patterns of various species, risk of extinction in endangered populations - are limited by the lack of rich… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: IEEE Conference on Computer Vision and Pattern Recognition, CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling Workshop, 2021

  8. arXiv:2102.06624  [pdf, other

    eess.IV cs.CV

    A Generative Model for Hallucinating Diverse Versions of Super Resolution Images

    Authors: Mohamed Abderrahmen Abid, Ihsen Hedhli, Christian Gagné

    Abstract: Traditionally, the main focus of image super-resolution techniques is on recovering the most likely high-quality images from low-quality images, using a one-to-one low- to high-resolution mapping. Proceeding that way, we ignore the fact that there are generally many valid versions of high-resolution images that map to a given low-resolution image. We are tackling in this work the problem of obtain… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

  9. arXiv:2002.02852  [pdf, other

    cs.CV cs.LG eess.IV

    Input Dropout for Spatially Aligned Modalities

    Authors: Sébastien de Blois, Mathieu Garon, Christian Gagné, Jean-François Lalonde

    Abstract: Computer vision datasets containing multiple modalities such as color, depth, and thermal properties are now commonly accessible and useful for solving a wide array of challenging tasks. However, deploying multi-sensor heads is not possible in many scenarios. As such many practical solutions tend to be based on simpler sensors, mostly for cost, simplicity and robustness considerations. In this wor… ▽ More

    Submitted 21 May, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

    Comments: Accepted in ICIP 2020. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  10. Learning of Image Dehazing Models for Segmentation Tasks

    Authors: Sébastien de Blois, Ihsen Hedhli, Christian Gagné

    Abstract: To evaluate their performance, existing dehazing approaches generally rely on distance measures between the generated image and its corresponding ground truth. Despite its ability to produce visually good images, using pixel-based or even perceptual metrics do not guarantee, in general, that the produced image is fit for being used as input for low-level computer vision tasks such as segmentation.… ▽ More

    Submitted 22 June, 2019; v1 submitted 4 March, 2019; originally announced March 2019.

    Comments: Accepted in EUSIPCO 2019