Skip to main content

Showing 1–15 of 15 results for author: Kurach, K

  1. arXiv:1907.11180  [pdf, other

    cs.LG stat.ML

    Google Research Football: A Novel Reinforcement Learning Environment

    Authors: Karol Kurach, Anton Raichuk, Piotr Stańczyk, Michał Zając, Olivier Bachem, Lasse Espeholt, Carlos Riquelme, Damien Vincent, Marcin Michalski, Olivier Bousquet, Sylvain Gelly

    Abstract: Recent progress in the field of reinforcement learning has been accelerated by virtual learning environments such as video games, where novel algorithms and ideas can be quickly tested in a safe and reproducible manner. We introduce the Google Research Football Environment, a new reinforcement learning environment where agents are trained to play football in an advanced, physics-based 3D simulator… ▽ More

    Submitted 14 April, 2020; v1 submitted 25 July, 2019; originally announced July 2019.

  2. arXiv:1812.01717  [pdf, other

    cs.CV cs.AI cs.LG cs.NE stat.ML

    Towards Accurate Generative Models of Video: A New Metric & Challenges

    Authors: Thomas Unterthiner, Sjoerd van Steenkiste, Karol Kurach, Raphael Marinier, Marcin Michalski, Sylvain Gelly

    Abstract: Recent advances in deep generative models have lead to remarkable progress in synthesizing high quality images. Following their successful application in image processing and representation learning, an important next step is to consider videos. Learning generative models of video is a much harder task, requiring a model to capture the temporal dynamics of a scene, in addition to the visual presen… ▽ More

    Submitted 27 March, 2019; v1 submitted 2 December, 2018; originally announced December 2018.

  3. arXiv:1811.07605  [pdf, other

    cs.LG cs.CV stat.ML

    Adversarial Autoencoders for Compact Representations of 3D Point Clouds

    Authors: Maciej Zamorski, Maciej Zięba, Piotr Klukowski, Rafał Nowak, Karol Kurach, Wojciech Stokowiec, Tomasz Trzciński

    Abstract: Deep generative architectures provide a way to model not only images but also complex, 3-dimensional objects, such as point clouds. In this work, we present a novel method to obtain meaningful representations of 3D shapes that can be used for challenging tasks including 3D points generation, reconstruction, compression, and clustering. Contrary to existing methods for 3D point cloud generation tha… ▽ More

    Submitted 1 May, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

    Comments: 10 pages, 8 figures

  4. Investigating Object Compositionality in Generative Adversarial Networks

    Authors: Sjoerd van Steenkiste, Karol Kurach, Jürgen Schmidhuber, Sylvain Gelly

    Abstract: Deep generative models seek to recover the process with which the observed data was generated. They may be used to synthesize new samples or to subsequently extract representations. Successful approaches in the domain of images are driven by several core inductive biases. However, a bias to account for the compositional way in which humans structure a visual scene in terms of objects has frequentl… ▽ More

    Submitted 24 July, 2020; v1 submitted 17 October, 2018; originally announced October 2018.

    Comments: A preliminary version of this work (arXiv v1) appeared under the title "A Case for Object Compositionality in Deep Generative Models of Images" as a workshop paper at the NeurIPS2018 workshop on "Modeling the Physical World: Perception, Learning, and Control", and at the NeurIPS2018 workshop on "Relational Representation Learning"

    MSC Class: I.2.6 ACM Class: I.2.6

  5. arXiv:1807.04720  [pdf, other

    cs.LG stat.ML

    A Large-Scale Study on Regularization and Normalization in GANs

    Authors: Karol Kurach, Mario Lucic, Xiaohua Zhai, Marcin Michalski, Sylvain Gelly

    Abstract: Generative adversarial networks (GANs) are a class of deep generative models which aim to learn a target distribution in an unsupervised fashion. While they were successfully applied to many problems, training a GAN is a notoriously challenging task and requires a significant number of hyperparameter tuning, neural architecture engineering, and a non-trivial amount of "tricks". The success in many… ▽ More

    Submitted 14 May, 2019; v1 submitted 12 July, 2018; originally announced July 2018.

    Comments: Revision accepted to ICML'19: More focus on regularization and normalization aspects. Added recent references and promising future directions

  6. arXiv:1803.11203  [pdf, other

    cs.LG

    MemGEN: Memory is All You Need

    Authors: Sylvain Gelly, Karol Kurach, Marcin Michalski, Xiaohua Zhai

    Abstract: We propose a new learning paradigm called Deep Memory. It has the potential to completely revolutionize the Machine Learning field. Surprisingly, this paradigm has not been reinvented yet, unlike Deep Learning. At the core of this approach is the \textit{Learning By Heart} principle, well studied in primary schools all over the world. Inspired by poem recitation, or by $π$ decimal memorization,… ▽ More

    Submitted 29 March, 2018; originally announced March 2018.

  7. arXiv:1711.10337  [pdf, other

    stat.ML cs.LG

    Are GANs Created Equal? A Large-Scale Study

    Authors: Mario Lucic, Karol Kurach, Marcin Michalski, Sylvain Gelly, Olivier Bousquet

    Abstract: Generative adversarial networks (GAN) are a powerful subclass of generative models. Despite a very rich research activity leading to numerous interesting GAN algorithms, it is still very hard to assess which algorithm(s) perform better than others. We conduct a neutral, multi-faceted large-scale empirical study on state-of-the art models and evaluation measures. We find that most models can reach… ▽ More

    Submitted 29 October, 2018; v1 submitted 28 November, 2017; originally announced November 2017.

    Comments: NIPS'18: Added a section on the limitations of the study and additional empirical results

  8. arXiv:1706.03200  [pdf, other

    cs.LG

    Critical Hyper-Parameters: No Random, No Cry

    Authors: Olivier Bousquet, Sylvain Gelly, Karol Kurach, Olivier Teytaud, Damien Vincent

    Abstract: The selection of hyper-parameters is critical in Deep Learning. Because of the long training time of complex models and the availability of compute resources in the cloud, "one-shot" optimization schemes - where the sets of hyper-parameters are selected in advance (e.g. on a grid or in a random manner) and the training is executed in parallel - are commonly used. It is known that grid search is su… ▽ More

    Submitted 10 June, 2017; originally announced June 2017.

  9. arXiv:1706.03199  [pdf, other

    cs.LG

    Toward Optimal Run Racing: Application to Deep Learning Calibration

    Authors: Olivier Bousquet, Sylvain Gelly, Karol Kurach, Marc Schoenauer, Michele Sebag, Olivier Teytaud, Damien Vincent

    Abstract: This paper aims at one-shot learning of deep neural nets, where a highly parallel setting is considered to address the algorithm calibration problem - selecting the best neural architecture and learning hyper-parameter values depending on the dataset at hand. The notoriously expensive calibration problem is optimally reduced by detecting and early stopping non-optimal runs. The theoretical contrib… ▽ More

    Submitted 20 June, 2017; v1 submitted 10 June, 2017; originally announced June 2017.

  10. arXiv:1705.08386  [pdf, other

    cs.CL cs.CV cs.LG

    Better Text Understanding Through Image-To-Text Transfer

    Authors: Karol Kurach, Sylvain Gelly, Michal Jastrzebski, Philip Haeusser, Olivier Teytaud, Damien Vincent, Olivier Bousquet

    Abstract: Generic text embeddings are successfully used in a variety of tasks. However, they are often learnt by capturing the co-occurrence structure from pure text corpora, resulting in limitations of their ability to generalize. In this paper, we explore models that incorporate visual information into the text representation. Based on comprehensive ablation studies, we propose a conceptually simple, yet… ▽ More

    Submitted 26 May, 2017; v1 submitted 23 May, 2017; originally announced May 2017.

  11. arXiv:1606.04870  [pdf, other

    cs.CL

    Smart Reply: Automated Response Suggestion for Email

    Authors: Anjuli Kannan, Karol Kurach, Sujith Ravi, Tobias Kaufmann, Andrew Tomkins, Balint Miklos, Greg Corrado, Laszlo Lukacs, Marina Ganea, Peter Young, Vivek Ramavajjala

    Abstract: In this paper we propose and investigate a novel end-to-end method for automatically generating short email responses, called Smart Reply. It generates semantically diverse suggestions that can be used as complete email responses with just one tap on mobile. The system is currently used in Inbox by Gmail and is responsible for assisting with 10% of all mobile responses. It is designed to work at v… ▽ More

    Submitted 15 June, 2016; originally announced June 2016.

    Comments: Accepted to KDD 2016

  12. arXiv:1602.03218  [pdf, ps, other

    cs.LG

    Learning Efficient Algorithms with Hierarchical Attentive Memory

    Authors: Marcin Andrychowicz, Karol Kurach

    Abstract: In this paper, we propose and investigate a novel memory architecture for neural networks called Hierarchical Attentive Memory (HAM). It is based on a binary tree with leaves corresponding to memory cells. This allows HAM to perform memory access in O(log n) complexity, which is a significant improvement over the standard attention mechanism that requires O(n) operations, where n is the size of th… ▽ More

    Submitted 23 February, 2016; v1 submitted 9 February, 2016; originally announced February 2016.

    Comments: Added soft attention appendix

  13. arXiv:1511.06807  [pdf, other

    stat.ML cs.LG

    Adding Gradient Noise Improves Learning for Very Deep Networks

    Authors: Arvind Neelakantan, Luke Vilnis, Quoc V. Le, Ilya Sutskever, Lukasz Kaiser, Karol Kurach, James Martens

    Abstract: Deep feedforward and recurrent networks have achieved impressive results in many perception and language processing applications. This success is partially attributed to architectural innovations such as convolutional and long short-term memory networks. The main motivation for these architectural innovations is that they capture better domain knowledge, and importantly are easier to optimize than… ▽ More

    Submitted 20 November, 2015; originally announced November 2015.

  14. arXiv:1511.06392  [pdf, other

    cs.LG cs.NE

    Neural Random-Access Machines

    Authors: Karol Kurach, Marcin Andrychowicz, Ilya Sutskever

    Abstract: In this paper, we propose and investigate a new neural network architecture called Neural Random Access Machine. It can manipulate and dereference pointers to an external variable-size random-access memory. The model is trained from pure input-output examples using backpropagation. We evaluate the new model on a number of simple algorithmic tasks whose solutions require pointer manipulation and… ▽ More

    Submitted 9 February, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: ICLR submission, 17 pages, 9 figures, 6 tables (with bibliography and appendix)

  15. arXiv:1406.1584  [pdf, other

    cs.LG

    Learning to Discover Efficient Mathematical Identities

    Authors: Wojciech Zaremba, Karol Kurach, Rob Fergus

    Abstract: In this paper we explore how machine learning techniques can be applied to the discovery of efficient mathematical identities. We introduce an attribute grammar framework for representing symbolic expressions. Given a set of grammar rules we build trees that combine different rules, looking for branches which yield compositions that are analytically equivalent to a target expression, but of lower… ▽ More

    Submitted 5 November, 2014; v1 submitted 6 June, 2014; originally announced June 2014.