Skip to main content

Showing 1–31 of 31 results for author: Dumoulin, V

  1. arXiv:2406.19975  [pdf, other

    cs.CR

    On the Response Entropy of APUFs

    Authors: Vincent Dumoulin, Wenjing Rao, Natasha Devroye

    Abstract: A Physically Unclonable Function (PUF) is a hardware security primitive used for authentication and key generation. It takes an input bit-vector challenge and produces a single-bit response, resulting in a challenge-response pair (CRP). The truth table of all challenge-response pairs of each manufactured PUF should look different due to inherent manufacturing randomness, forming a digital fingerpr… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2406.09073  [pdf, other

    cs.LG

    Are we making progress in unlearning? Findings from the first NeurIPS unlearning competition

    Authors: Eleni Triantafillou, Peter Kairouz, Fabian Pedregosa, Jamie Hayes, Meghdad Kurmanji, Kairan Zhao, Vincent Dumoulin, Julio Jacques Junior, Ioannis Mitliagkas, Jun Wan, Lisheng Sun Hosoya, Sergio Escalera, Gintare Karolina Dziugaite, Peter Triantafillou, Isabelle Guyon

    Abstract: We present the findings of the first NeurIPS competition on unlearning, which sought to stimulate the development of novel algorithms and initiate discussions on formal and robust evaluation methodologies. The competition was highly successful: nearly 1,200 teams from across the world participated, and a wealth of novel, imaginative solutions with different characteristics were contributed. In thi… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2405.10425  [pdf, other

    cs.LG

    Data Selection for Transfer Unlearning

    Authors: Nazanin Mohammadi Sepahvand, Vincent Dumoulin, Eleni Triantafillou, Gintare Karolina Dziugaite

    Abstract: As deep learning models are becoming larger and data-hungrier, there are growing ethical, legal and technical concerns over use of data: in practice, agreements on data use may change over time, rendering previously-used training data impermissible for training purposes. These issues have driven increased attention to machine unlearning: removing "the influence of" a subset of training data from a… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  4. arXiv:2404.16436  [pdf

    cs.SD cs.AI cs.LG eess.AS

    Leveraging tropical reef, bird and unrelated sounds for superior transfer learning in marine bioacoustics

    Authors: Ben Williams, Bart van Merriënboer, Vincent Dumoulin, Jenny Hamer, Eleni Triantafillou, Abram B. Fleishman, Matthew McKown, Jill E. Munger, Aaron N. Rice, Ashlee Lillis, Clemency E. White, Catherine A. D. Hobbs, Tries B. Razak, Kate E. Jones, Tom Denton

    Abstract: Machine learning has the potential to revolutionize passive acoustic monitoring (PAM) for ecological assessments. However, high annotation and compute costs limit the field's efficacy. Generalizable pretrained networks can overcome these costs, but high-quality pretraining requires vast annotated libraries, limiting its current applicability primarily to bird taxa. Here, we identify the optimum pr… ▽ More

    Submitted 7 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 18 pages, 5 figures

  5. arXiv:2312.07439  [pdf, other

    cs.LG

    BIRB: A Generalization Benchmark for Information Retrieval in Bioacoustics

    Authors: Jenny Hamer, Eleni Triantafillou, Bart van Merriënboer, Stefan Kahl, Holger Klinck, Tom Denton, Vincent Dumoulin

    Abstract: The ability for a machine learning model to cope with differences in training and deployment conditions--e.g. in the presence of distribution shift or the generalization to new classes altogether--is crucial for real-world use cases. However, most empirical work in this area has focused on the image domain with artificial benchmarks constructed to measure individual aspects of generalization. We p… ▽ More

    Submitted 13 December, 2023; v1 submitted 12 December, 2023; originally announced December 2023.

  6. arXiv:2311.14115  [pdf, other

    cs.LG cs.AI cs.CL

    A density estimation perspective on learning from pairwise human preferences

    Authors: Vincent Dumoulin, Daniel D. Johnson, Pablo Samuel Castro, Hugo Larochelle, Yann Dauphin

    Abstract: Learning from human feedback (LHF) -- and in particular learning from pairwise preferences -- has recently become a crucial ingredient in training large language models (LLMs), and has been the subject of much research. Most recent works frame it as a reinforcement learning problem, where a reward function is learned from pairwise preference data and the LLM is treated as a policy which is adapted… ▽ More

    Submitted 10 January, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

  7. arXiv:2308.13645  [pdf, other

    cs.CR cs.LG

    Active learning for fast and slow modeling attacks on Arbiter PUFs

    Authors: Vincent Dumoulin, Wenjing Rao, Natasha Devroye

    Abstract: Modeling attacks, in which an adversary uses machine learning techniques to model a hardware-based Physically Unclonable Function (PUF) pose a great threat to the viability of these hardware security primitives. In most modeling attacks, a random subset of challenge-response-pairs (CRPs) are used as the labeled data for the machine learning algorithm. Here, for the arbiter-PUF, a delay based PUF w… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  8. arXiv:2302.06658  [pdf, other

    cs.LG

    In Search for a Generalizable Method for Source Free Domain Adaptation

    Authors: Malik Boudiaf, Tom Denton, Bart van Merriënboer, Vincent Dumoulin, Eleni Triantafillou

    Abstract: Source-free domain adaptation (SFDA) is compelling because it allows adapting an off-the-shelf model to a new domain using only unlabelled data. In this work, we apply existing SFDA techniques to a challenging set of naturally-occurring distribution shifts in bioacoustics, which are very different from the ones commonly studied in computer vision. We find existing methods perform differently relat… ▽ More

    Submitted 24 June, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  9. arXiv:2204.00484  [pdf, other

    cs.CV cs.LG

    Proper Reuse of Image Classification Features Improves Object Detection

    Authors: Cristina Vasconcelos, Vighnesh Birodkar, Vincent Dumoulin

    Abstract: A common practice in transfer learning is to initialize the downstream model weights by pre-training on a data-abundant upstream task. In object detection specifically, the feature backbone is typically initialized with Imagenet classifier weights and fine-tuned on the object detection task. Recent works show this is not strictly necessary under longer training regimes and provide recipes for trai… ▽ More

    Submitted 27 June, 2022; v1 submitted 1 April, 2022; originally announced April 2022.

    Journal ref: CVPR 2022

  10. arXiv:2201.03529  [pdf, other

    cs.LG cs.CV

    Head2Toe: Utilizing Intermediate Representations for Better Transfer Learning

    Authors: Utku Evci, Vincent Dumoulin, Hugo Larochelle, Michael C. Mozer

    Abstract: Transfer-learning methods aim to improve performance in a data-scarce target domain using a model pretrained on a data-rich source domain. A cost-efficient strategy, linear probing, involves freezing the source model and training a new classification head for the target domain. This strategy is outperformed by a more costly but state-of-the-art method -- fine-tuning all parameters of the source mo… ▽ More

    Submitted 25 July, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: presented at ICML 2022 (Oral)

    Journal ref: ICML 2022, Proceedings of the 39th International Conference on Machine Learning

  11. arXiv:2108.03489  [pdf, other

    cs.CV cs.LG

    Impact of Aliasing on Generalization in Deep Convolutional Networks

    Authors: Cristina Vasconcelos, Hugo Larochelle, Vincent Dumoulin, Rob Romijnders, Nicolas Le Roux, Ross Goroshin

    Abstract: We investigate the impact of aliasing on generalization in Deep Convolutional Networks and show that data augmentation schemes alone are unable to prevent it due to structural limitations in widely used architectures. Drawing insights from frequency analysis theory, we take a closer look at ResNet and EfficientNet architectures and review the trade-off between aliasing and information loss in each… ▽ More

    Submitted 7 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021. arXiv admin note: text overlap with arXiv:2011.10675

  12. arXiv:2106.13899  [pdf, other

    cs.LG cs.AI cs.CV

    Domain Conditional Predictors for Domain Adaptation

    Authors: Joao Monteiro, Xavier Gibert, Jianqiao Feng, Vincent Dumoulin, Dar-Shyang Lee

    Abstract: Learning guarantees often rely on assumptions of i.i.d. data, which will likely be violated in practice once predictors are deployed to perform real-world tasks. Domain adaptation approaches thus appeared as a useful framework yielding extra flexibility in that distinct train and test data distributions are supported, provided that other assumptions are satisfied such as covariate shift, which exp… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

    Comments: Part of the pre-registration workshop at NeurIPS 2020: https://preregister.science/

  13. arXiv:2105.07029  [pdf, other

    cs.LG cs.CV

    Learning a Universal Template for Few-shot Dataset Generalization

    Authors: Eleni Triantafillou, Hugo Larochelle, Richard Zemel, Vincent Dumoulin

    Abstract: Few-shot dataset generalization is a challenging variant of the well-studied few-shot classification problem where a diverse training set of several datasets is given, for the purpose of training an adaptable model that can then learn classes from new datasets using only a few examples. To this end, we propose to utilize the diverse training set to construct a universal template: a partial model t… ▽ More

    Submitted 21 June, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

  14. arXiv:2104.02638  [pdf, other

    cs.LG cs.CV

    Comparing Transfer and Meta Learning Approaches on a Unified Few-Shot Classification Benchmark

    Authors: Vincent Dumoulin, Neil Houlsby, Utku Evci, Xiaohua Zhai, Ross Goroshin, Sylvain Gelly, Hugo Larochelle

    Abstract: Meta and transfer learning are two successful families of approaches to few-shot learning. Despite highly related goals, state-of-the-art advances in each family are measured largely in isolation of each other. As a result of diverging evaluation norms, a direct or thorough comparison of different approaches is challenging. To bridge this gap, we perform a cross-family study of the best transfer a… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

  15. arXiv:2011.10675  [pdf, other

    cs.CV

    An Effective Anti-Aliasing Approach for Residual Networks

    Authors: Cristina Vasconcelos, Hugo Larochelle, Vincent Dumoulin, Nicolas Le Roux, Ross Goroshin

    Abstract: Image pre-processing in the frequency domain has traditionally played a vital role in computer vision and was even part of the standard pipeline in the early days of deep learning. However, with the advent of large datasets, many practitioners concluded that this was unnecessary due to the belief that these priors can be learned from the data itself. Frequency aliasing is a phenomenon that may occ… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

  16. arXiv:1903.03096  [pdf, other

    cs.LG stat.ML

    Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples

    Authors: Eleni Triantafillou, Tyler Zhu, Vincent Dumoulin, Pascal Lamblin, Utku Evci, Kelvin Xu, Ross Goroshin, Carles Gelada, Kevin Swersky, Pierre-Antoine Manzagol, Hugo Larochelle

    Abstract: Few-shot classification refers to learning a classifier for new classes given only a few examples. While a plethora of models have emerged to tackle it, we find the procedure and datasets that are used to assess their progress lacking. To address this limitation, we propose Meta-Dataset: a new benchmark for training and evaluating models that is large-scale, consists of diverse datasets, and prese… ▽ More

    Submitted 8 April, 2020; v1 submitted 7 March, 2019; originally announced March 2019.

    Comments: Code available at https://github.com/google-research/meta-dataset

    Journal ref: International Conference on Learning Representations (2020)

  17. The Hanabi Challenge: A New Frontier for AI Research

    Authors: Nolan Bard, Jakob N. Foerster, Sarath Chandar, Neil Burch, Marc Lanctot, H. Francis Song, Emilio Parisotto, Vincent Dumoulin, Subhodeep Moitra, Edward Hughes, Iain Dunning, Shibl Mourad, Hugo Larochelle, Marc G. Bellemare, Michael Bowling

    Abstract: From the early days of computing, games have been important testbeds for studying how well machines can do sophisticated decision making. In recent years, machine learning has made dramatic advances with artificial agents reaching superhuman performance in challenge domains like Go, Atari, and some variants of poker. As with their predecessors of chess, checkers, and backgammon, these game domains… ▽ More

    Submitted 6 December, 2019; v1 submitted 1 February, 2019; originally announced February 2019.

    Comments: 32 pages, 5 figures, In Press (Artificial Intelligence)

  18. arXiv:1812.00273  [pdf, other

    cs.LG cs.CV stat.ML

    Cross-Modulation Networks for Few-Shot Learning

    Authors: Hugo Prol, Vincent Dumoulin, Luis Herranz

    Abstract: A family of recent successful approaches to few-shot learning relies on learning an embedding space in which predictions are made by computing similarities between examples. This corresponds to combining information between support and query examples at a very late stage of the prediction pipeline. Inspired by this observation, we hypothesize that there may be benefits to combining the information… ▽ More

    Submitted 1 December, 2018; originally announced December 2018.

    Comments: Accepted at NIPS 2018 Workshop on Meta-Learning. Source code available at https://github.com/hprop/cross-modulation-nets

  19. Generative Adversarial Networks: An Overview

    Authors: Antonia Creswell, Tom White, Vincent Dumoulin, Kai Arulkumaran, Biswa Sengupta, Anil A Bharath

    Abstract: Generative adversarial networks (GANs) provide a way to learn deep representations without extensively annotated training data. They achieve this through deriving backpropagation signals through a competitive process involving a pair of networks. The representations that can be learned by GANs may be used in a variety of applications, including image synthesis, semantic image editing, style transf… ▽ More

    Submitted 19 October, 2017; originally announced October 2017.

    Comments: Accepted in the IEEE Signal Processing Magazine Special Issue on Deep Learning for Visual Understanding

  20. arXiv:1709.07871  [pdf, other

    cs.CV cs.AI cs.CL stat.ML

    FiLM: Visual Reasoning with a General Conditioning Layer

    Authors: Ethan Perez, Florian Strub, Harm de Vries, Vincent Dumoulin, Aaron Courville

    Abstract: We introduce a general-purpose conditioning method for neural networks called FiLM: Feature-wise Linear Modulation. FiLM layers influence neural network computation via a simple, feature-wise affine transformation based on conditioning information. We show that FiLM layers are highly effective for visual reasoning - answering image-related questions which require a multi-step, high-level process -… ▽ More

    Submitted 18 December, 2017; v1 submitted 22 September, 2017; originally announced September 2017.

    Comments: AAAI 2018. Code available at http://github.com/ethanjperez/film . Extends arXiv:1707.03017

  21. arXiv:1707.03017  [pdf, other

    cs.CV cs.AI cs.CL stat.ML

    Learning Visual Reasoning Without Strong Priors

    Authors: Ethan Perez, Harm de Vries, Florian Strub, Vincent Dumoulin, Aaron Courville

    Abstract: Achieving artificial visual reasoning - the ability to answer image-related questions which require a multi-step, high-level process - is an important step towards artificial general intelligence. This multi-modal task requires learning a question-dependent, structured reasoning process over images from language. Standard deep learning approaches tend to exploit biases in the data rather than lear… ▽ More

    Submitted 18 December, 2017; v1 submitted 10 July, 2017; originally announced July 2017.

    Comments: Full AAAI 2018 paper is at arXiv:1709.07871. Presented at ICML 2017's Machine Learning in Speech and Language Processing Workshop. Code is at http://github.com/ethanjperez/film

  22. arXiv:1705.06830  [pdf, other

    cs.CV

    Exploring the structure of a real-time, arbitrary neural artistic stylization network

    Authors: Golnaz Ghiasi, Honglak Lee, Manjunath Kudlur, Vincent Dumoulin, Jonathon Shlens

    Abstract: In this paper, we present a method which combines the flexibility of the neural algorithm of artistic style with the speed of fast style transfer networks to allow real-time stylization using any content/style image pair. We build upon recent work leveraging conditional instance normalization for multi-style transfer networks by learning to predict the conditional instance normalization parameters… ▽ More

    Submitted 24 August, 2017; v1 submitted 18 May, 2017; originally announced May 2017.

    Comments: Accepted as an oral presentation at British Machine Vision Conference (BMVC) 2017

  23. arXiv:1704.00028  [pdf, other

    cs.LG stat.ML

    Improved Training of Wasserstein GANs

    Authors: Ishaan Gulrajani, Faruk Ahmed, Martin Arjovsky, Vincent Dumoulin, Aaron Courville

    Abstract: Generative Adversarial Networks (GANs) are powerful generative models, but suffer from training instability. The recently proposed Wasserstein GAN (WGAN) makes progress toward stable training of GANs, but sometimes can still generate only low-quality samples or fail to converge. We find that these problems are often due to the use of weight clipping in WGAN to enforce a Lipschitz constraint on the… ▽ More

    Submitted 25 December, 2017; v1 submitted 31 March, 2017; originally announced April 2017.

    Comments: NIPS camera-ready

  24. arXiv:1610.07629  [pdf, other

    cs.CV cs.LG

    A Learned Representation For Artistic Style

    Authors: Vincent Dumoulin, Jonathon Shlens, Manjunath Kudlur

    Abstract: The diversity of painting styles represents a rich visual vocabulary for the construction of an image. The degree to which one may learn and parsimoniously capture this visual vocabulary measures our understanding of the higher level features of paintings, if not images in general. In this work we investigate the construction of a single, scalable deep network that can parsimoniously capture the a… ▽ More

    Submitted 9 February, 2017; v1 submitted 24 October, 2016; originally announced October 2016.

    Comments: 9 pages. 15 pages of Appendix, International Conference on Learning Representations (ICLR) 2017

  25. arXiv:1606.00704  [pdf, other

    stat.ML cs.LG

    Adversarially Learned Inference

    Authors: Vincent Dumoulin, Ishmael Belghazi, Ben Poole, Olivier Mastropietro, Alex Lamb, Martin Arjovsky, Aaron Courville

    Abstract: We introduce the adversarially learned inference (ALI) model, which jointly learns a generation network and an inference network using an adversarial process. The generation network maps samples from stochastic latent variables to the data space while the inference network maps training examples in data space to the space of latent variables. An adversarial game is cast between these two networks… ▽ More

    Submitted 21 February, 2017; v1 submitted 2 June, 2016; originally announced June 2016.

  26. arXiv:1605.02688  [pdf, other

    cs.SC cs.LG cs.MS

    Theano: A Python framework for fast computation of mathematical expressions

    Authors: The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano , et al. (88 additional authors not shown)

    Abstract: Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, mu… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 19 pages, 5 figures

  27. arXiv:1603.07285  [pdf, other

    stat.ML cs.LG cs.NE

    A guide to convolution arithmetic for deep learning

    Authors: Vincent Dumoulin, Francesco Visin

    Abstract: We introduce a guide to help deep learning practitioners understand and manipulate convolutional neural network architectures. The guide clarifies the relationship between various properties (input shape, kernel shape, zero padding, strides and output shape) of convolutional, pooling and transposed convolutional layers, as well as the relationship between convolutional and transposed convolutional… ▽ More

    Submitted 11 January, 2018; v1 submitted 23 March, 2016; originally announced March 2016.

  28. arXiv:1602.03220  [pdf, other

    stat.ML cs.LG

    Discriminative Regularization for Generative Models

    Authors: Alex Lamb, Vincent Dumoulin, Aaron Courville

    Abstract: We explore the question of whether the representations learned by classifiers can be used to enhance the quality of generative models. Our conjecture is that labels correspond to characteristics of natural data which are most salient to humans: identity in faces, objects in images, and utterances in speech. We propose to take advantage of this by using the representations from discriminative class… ▽ More

    Submitted 15 February, 2016; v1 submitted 9 February, 2016; originally announced February 2016.

  29. arXiv:1506.00619  [pdf, ps, other

    cs.LG cs.NE stat.ML

    Blocks and Fuel: Frameworks for deep learning

    Authors: Bart van Merriënboer, Dzmitry Bahdanau, Vincent Dumoulin, Dmitriy Serdyuk, David Warde-Farley, Jan Chorowski, Yoshua Bengio

    Abstract: We introduce two Python frameworks to train neural networks on large datasets: Blocks and Fuel. Blocks is based on Theano, a linear algebra compiler with CUDA-support. It facilitates the training of complex neural network models by providing parametrized Theano operations, attaching metadata to Theano's symbolic computational graph, and providing an extensive set of utilities to assist training th… ▽ More

    Submitted 1 June, 2015; originally announced June 2015.

  30. arXiv:1312.5258  [pdf, other

    stat.ML cs.LG

    On the Challenges of Physical Implementations of RBMs

    Authors: Vincent Dumoulin, Ian J. Goodfellow, Aaron Courville, Yoshua Bengio

    Abstract: Restricted Boltzmann machines (RBMs) are powerful machine learning models, but learning and some kinds of inference in the model require sampling-based approximations, which, in classical digital computers, are implemented using expensive MCMC. Physical computation offers the opportunity to reduce the cost of sampling by building physical systems whose natural dynamics correspond to drawing sample… ▽ More

    Submitted 24 October, 2014; v1 submitted 18 December, 2013; originally announced December 2013.

    Journal ref: Proc. AAAI 2014, pp. 1199-1205

  31. arXiv:1308.4214  [pdf, ps, other

    stat.ML cs.LG cs.MS

    Pylearn2: a machine learning research library

    Authors: Ian J. Goodfellow, David Warde-Farley, Pascal Lamblin, Vincent Dumoulin, Mehdi Mirza, Razvan Pascanu, James Bergstra, Frédéric Bastien, Yoshua Bengio

    Abstract: Pylearn2 is a machine learning research library. This does not just mean that it is a collection of machine learning algorithms that share a common API; it means that it has been designed for flexibility and extensibility in order to facilitate research projects that involve new or unusual use cases. In this paper we give a brief history of the library, an overview of its basic philosophy, a summa… ▽ More

    Submitted 19 August, 2013; originally announced August 2013.

    Comments: 9 pages