Skip to main content

Showing 1–43 of 43 results for author: Hare, J

  1. arXiv:2405.01693  [pdf, other

    cs.CR

    Adversarial Attacks on Reinforcement Learning Agents for Command and Control

    Authors: Ahaan Dabholkar, James Z. Hare, Mark Mittrick, John Richardson, Nicholas Waytowich, Priya Narayanan, Saurabh Bagchi

    Abstract: Given the recent impact of Deep Reinforcement Learning in training agents to win complex games like StarCraft and DoTA(Defense Of The Ancients) - there has been a surge in research for exploiting learning based techniques for professional wargaming, battlefield simulation and modeling. Real time strategy games and simulators have become a valuable resource for operational planning and military res… ▽ More

    Submitted 1 July, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: Accepted to appear in the Journal Of Defense Modeling and Simulation (JDMS)

  2. arXiv:2401.08965  [pdf, other

    cs.CV

    Dynamic DNNs and Runtime Management for Efficient Inference on Mobile/Embedded Devices

    Authors: Lei Xun, Jonathon Hare, Geoff V. Merrett

    Abstract: Deep neural network (DNN) inference is increasingly being executed on mobile and embedded platforms due to several key advantages in latency, privacy and always-on availability. However, due to limited computing resources, efficient DNN deployment on mobile and embedded platforms is challenging. Although many hardware accelerators and static model compression methods were proposed by previous work… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted at Design, Automation & Test in Europe Conference (DATE) 2024, PhD Forum

  3. arXiv:2401.08943  [pdf, other

    cs.CV

    Fluid Dynamic DNNs for Reliable and Adaptive Distributed Inference on Edge Devices

    Authors: Lei Xun, Mingyu Hu, Hengrui Zhao, Amit Kumar Singh, Jonathon Hare, Geoff V. Merrett

    Abstract: Distributed inference is a popular approach for efficient DNN inference at the edge. However, traditional Static and Dynamic DNNs are not distribution-friendly, causing system reliability and adaptability issues. In this paper, we introduce Fluid Dynamic DNNs (Fluid DyDNNs), tailored for distributed inference. Distinct from Static and Dynamic DNNs, Fluid DyDNNs utilize a novel nested incremental t… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted at Design, Automation & Test in Europe Conference (DATE) 2024

  4. arXiv:2401.04290  [pdf, other

    cs.CV cs.AI cs.MA

    StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods For Multi-Agent Environments

    Authors: Sean Kulinski, Nicholas R. Waytowich, James Z. Hare, David I. Inouye

    Abstract: Spatial reasoning tasks in multi-agent environments such as event prediction, agent type identification, or missing data imputation are important for multiple applications (e.g., autonomous surveillance over sensor networks and subtasks for reinforcement learning (RL)). StarCraft II game replays encode intelligent (and adversarial) multi-agent behavior and could provide a testbed for these tasks;… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: Published in CVPR 23'

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023

  5. arXiv:2311.04740  [pdf, other

    cs.MA cs.LG cs.RO

    Enhancing Multi-Agent Coordination through Common Operating Picture Integration

    Authors: Peihong Yu, Bhoram Lee, Aswin Raghavan, Supun Samarasekara, Pratap Tokekar, James Zachary Hare

    Abstract: In multi-agent systems, agents possess only local observations of the environment. Communication between teammates becomes crucial for enhancing coordination. Past research has primarily focused on encoding local information into embedding messages which are unintelligible to humans. We find that using these messages in agent's policy learning leads to brittle policies when tested on out-of-distri… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: accepted to OODWorkshop@CoRL23; please see https://openreview.net/forum?id=fADcJl0B0P for the paper

  6. arXiv:2211.05624  [pdf, other

    cs.LG cs.NE

    Improving the Robustness of Neural Multiplication Units with Reversible Stochasticity

    Authors: Bhumika Mistry, Katayoun Farrahi, Jonathon Hare

    Abstract: Multilayer Perceptrons struggle to learn certain simple arithmetic tasks. Specialist neural modules for arithmetic can outperform classical architectures with gains in extrapolation, interpretability and convergence speeds, but are highly sensitive to the training range. In this paper, we show that Neural Multiplication Units (NMUs) are unable to reliably learn tasks as simple as multiplying two i… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: 26 pages (10 page main body)

  7. arXiv:2206.02525  [pdf, other

    cs.AR

    Dynamic DNNs Meet Runtime Resource Management on Mobile and Embedded Platforms

    Authors: Lei Xun, Bashir M. Al-Hashimi, Jonathon Hare, Geoff V. Merrett

    Abstract: Deep neural network (DNN) inference is increasingly being executed on mobile and embedded platforms due to low latency and better privacy. However, efficient deployment on these platforms is challenging due to the intensive computation and memory access. We propose a holistic system design for DNN performance and energy optimisation, combining the trade-off opportunities in both algorithms and har… ▽ More

    Submitted 6 June, 2022; v1 submitted 17 May, 2022; originally announced June 2022.

    Comments: Accepted as a presentation at Fourth UK Mobile, Wearable and Ubiquitous Systems Research Symposium (MobiUK 2022)

  8. arXiv:2205.05784  [pdf, other

    cs.LG cs.MA

    Learning to Guide Multiple Heterogeneous Actors from a Single Human Demonstration via Automatic Curriculum Learning in StarCraft II

    Authors: Nicholas Waytowich, James Hare, Vinicius G. Goecks, Mark Mittrick, John Richardson, Anjon Basak, Derrik E. Asher

    Abstract: Traditionally, learning from human demonstrations via direct behavior cloning can lead to high-performance policies given that the algorithm has access to large amounts of high-quality data covering the most likely scenarios to be encountered when the agent is operating. However, in real-world scenarios, expert data is limited and it is desired to train an agent that learns a behavior policy gener… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Submitted to the 2022 SPIE Defense + Commercial Sensing (DCS) Conference on "Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications IV"

    ACM Class: I.2.6; I.2.11

  9. arXiv:2202.07052  [pdf, other

    cs.LG

    Orthogonalising gradients to speed up neural network optimisation

    Authors: Mark Tuddenham, Adam Prügel-Bennett, Jonathan Hare

    Abstract: The optimisation of neural networks can be sped up by orthogonalising the gradients before the optimisation step, ensuring the diversification of the learned representations. We orthogonalise the gradients of the layer's components/filters with respect to each other to separate out the intermediate representations. Our method of orthogonalisation allows the weights to be used more flexibly, in con… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  10. arXiv:2201.08142  [pdf, other

    cs.RO cs.CV

    Physically Embodied Deep Image Optimisation

    Authors: Daniela Mihai, Jonathon Hare

    Abstract: Physical sketches are created by learning programs to control a drawing robot. A differentiable rasteriser is used to optimise sets of drawing strokes to match an input image, using deep networks to provide an encoding for which we can compute a loss. The optimised drawing primitives can then be translated into G-code commands which command a robot to draw the image using drawing instruments such… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Journal ref: 5th Workshop on Machine Learning for Creativity and Design of the Neural Information Processing Systems (NeurIPS) 2021 Conference

  11. arXiv:2110.08203  [pdf, other

    cs.LG cs.AI cs.CV

    Shared Visual Representations of Drawing for Communication: How do different biases affect human interpretability and intent?

    Authors: Daniela Mihai, Jonathon Hare

    Abstract: We present an investigation into how representational losses can affect the drawings produced by artificial agents playing a communication game. Building upon recent advances, we show that a combination of powerful pretrained encoder networks, with appropriate inductive biases, can lead to agents that draw recognisable sketches, whilst still communicating well. Further, we start to develop an appr… ▽ More

    Submitted 20 January, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Journal ref: 3rd Workshop on Shared Visual Representations in Human and Machine Intelligence (SVRHM 2021) of the Neural Information Processing Systems (NeurIPS) conference

  12. arXiv:2110.05177  [pdf, other

    cs.NE cs.LG stat.ML

    Learning Division with Neural Arithmetic Logic Modules

    Authors: Bhumika Mistry, Katayoun Farrahi, Jonathon Hare

    Abstract: To achieve systematic generalisation, it first makes sense to master simple tasks such as arithmetic. Of the four fundamental arithmetic operations (+,-,$\times$,$÷$), division is considered the most difficult for both humans and computers. In this paper we show that robustly learning division in a systematic manner remains a challenge even at the simplest level of dividing two numbers. We propose… ▽ More

    Submitted 12 October, 2021; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: 28 pages, 24 figures. New experiments included(Section 7 and Appendix G)

  13. arXiv:2109.09495  [pdf, other

    cs.LG cs.NE

    GhostShiftAddNet: More Features from Energy-Efficient Operations

    Authors: Jia Bi, Jonathon Hare, Geoff V. Merrett

    Abstract: Deep convolutional neural networks (CNNs) are computationally and memory intensive. In CNNs, intensive multiplication can have resource implications that may challenge the ability for effective deployment of inference on resource-constrained edge devices. This paper proposes GhostShiftAddNet, where the motivation is to implement a hardware-efficient deep network: a multiplication-free CNN with few… ▽ More

    Submitted 3 February, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

    Journal ref: The 32nd British Machine Vision Conference BMVC 2021

  14. arXiv:2107.12021  [pdf, other

    cs.CV

    Language Models as Zero-shot Visual Semantic Learners

    Authors: Yue Jiao, Jonathon Hare, Adam Prügel-Bennett

    Abstract: Visual Semantic Embedding (VSE) models, which map images into a rich semantic embedding space, have been a milestone in object recognition and zero-shot learning. Current approaches to VSE heavily rely on static word em-bedding techniques. In this work, we propose a Visual Se-mantic Embedding Probe (VSEP) designed to probe the semantic information of contextualized word embeddings in visual semant… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

  15. arXiv:2107.11991  [pdf, other

    cs.CV

    What Remains of Visual Semantic Embeddings

    Authors: Yue Jiao, Jonathon Hare, Adam Prügel-Bennett

    Abstract: Zero shot learning (ZSL) has seen a surge in interest over the decade for its tight links with the mechanism making young children recognize novel objects. Although different paradigms of visual semantic embedding models are designed to align visual features and distributed word representations, it is unclear to what extent current ZSL models encode semantic information from distributed word repre… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

  16. arXiv:2107.08199  [pdf, other

    cs.CL cs.LG

    Dynamic Transformer for Efficient Machine Translation on Embedded Devices

    Authors: Hishan Parry, Lei Xun, Amin Sabet, Jia Bi, Jonathon Hare, Geoff V. Merrett

    Abstract: The Transformer architecture is widely used for machine translation tasks. However, its resource-intensive nature makes it challenging to implement on constrained embedded devices, particularly where available hardware resources can vary at run-time. We propose a dynamic machine translation model that scales the Transformer architecture based on the available resources at any particular time. The… ▽ More

    Submitted 30 July, 2021; v1 submitted 17 July, 2021; originally announced July 2021.

    Comments: Accepted at MLCAD 2021

  17. arXiv:2106.11208  [pdf, other

    cs.CV

    Temporal Early Exits for Efficient Video Object Detection

    Authors: Amin Sabet, Jonathon Hare, Bashir Al-Hashimi, Geoff V. Merrett

    Abstract: Transferring image-based object detectors to the domain of video remains challenging under resource constraints. Previous efforts utilised optical flow to allow unchanged features to be propagated, however, the overhead is considerable when working with very slowly changing scenes from applications such as surveillance. In this paper, we propose temporal early exits to reduce the computational com… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

  18. arXiv:2106.02067  [pdf, other

    cs.CV cs.AI cs.LG cs.MA

    Learning to Draw: Emergent Communication through Sketching

    Authors: Daniela Mihai, Jonathon Hare

    Abstract: Evidence that visual communication preceded written language and provided a basis for it goes back to prehistory, in forms such as cave and rock paintings depicting traces of our distant ancestors. Emergent communication research has sought to explore how agents can learn to communicate in order to collaboratively solve tasks. Existing research has focused on language, with a learned communication… ▽ More

    Submitted 9 November, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

  19. arXiv:2105.03596  [pdf, other

    cs.CV

    Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms

    Authors: Wei Lou, Lei Xun, Amin Sabet, Jia Bi, Jonathon Hare, Geoff V. Merrett

    Abstract: Mobile and embedded platforms are increasingly required to efficiently execute computationally demanding DNNs across heterogeneous processing elements. At runtime, the available hardware resources to DNNs can vary considerably due to other concurrently running applications. The performance requirements of the applications could also change under different scenarios. To achieve the desired performa… ▽ More

    Submitted 11 May, 2021; v1 submitted 8 May, 2021; originally announced May 2021.

    Comments: Accepted at CVPR ECV Workshop 2021

  20. arXiv:2103.16194  [pdf, other

    cs.CV

    Differentiable Drawing and Sketching

    Authors: Daniela Mihai, Jonathon Hare

    Abstract: We present a bottom-up differentiable relaxation of the process of drawing points, lines and curves into a pixel raster. Our approach arises from the observation that rasterising a pixel in an image given parameters of a primitive can be reformulated in terms of the primitive's distance transform, and then relaxed to allow the primitive's parameters to be learned. This relaxation allows end-to-end… ▽ More

    Submitted 19 July, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

  21. arXiv:2101.11134  [pdf, other

    cs.CL cs.LG

    Open-domain Topic Identification of Out-of-domain Utterances using Wikipedia

    Authors: A. Augustin, A. Papangelis, M. Kotti, P. Vougiouklis, J. Hare, N. Braunschweiler

    Abstract: Users of spoken dialogue systems (SDS) expect high quality interactions across a wide range of diverse topics. However, the implementation of SDS capable of responding to every conceivable user utterance in an informative way is a challenging problem. Multi-domain SDS must necessarily identify and deal with out-of-domain (OOD) utterances to generate appropriate responses as users do not always kno… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

  22. arXiv:2101.10253  [pdf, other

    cs.CV cs.CL cs.LG

    The emergence of visual semantics through communication games

    Authors: Daniela Mihai, Jonathon Hare

    Abstract: The emergence of communication systems between agents which learn to play referential signalling games with realistic images has attracted a lot of attention recently. The majority of work has focused on using fixed, pretrained image feature extraction networks which potentially bias the information the agents learn to communicate. In this work, we consider a signalling game setting in which a `se… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

    Comments: arXiv admin note: text overlap with arXiv:1911.05546

  23. arXiv:2101.09530  [pdf, other

    cs.NE

    A Primer for Neural Arithmetic Logic Modules

    Authors: Bhumika Mistry, Katayoun Farrahi, Jonathon Hare

    Abstract: Neural Arithmetic Logic Modules have become a growing area of interest, though remain a niche field. These modules are neural networks which aim to achieve systematic generalisation in learning arithmetic and/or logic operations such as $\{+, -, \times, ÷, \leq, \textrm{AND}\}$ while also being interpretable. This paper is the first in discussing the current state of progress of this field, explai… ▽ More

    Submitted 8 August, 2022; v1 submitted 23 January, 2021; originally announced January 2021.

    Comments: JMLR Accepted Version, 58 pages

    Journal ref: Journal of Machine Learning Research 23 (2022) 1-58

  24. arXiv:2012.01938  [pdf, other

    cs.LG

    Quasi-Newton's method in the class gradient defined high-curvature subspace

    Authors: Mark Tuddenham, Adam Prügel-Bennett, Jonathan Hare

    Abstract: Classification problems using deep learning have been shown to have a high-curvature subspace in the loss landscape equal in dimension to the number of classes. Moreover, this subspace corresponds to the subspace spanned by the logit gradients for each class. An obvious strategy to speed up optimisation would be to use Newton's method in the high-curvature subspace and stochastic gradient descent… ▽ More

    Submitted 28 November, 2020; originally announced December 2020.

    Journal ref: OPT2020: 12th Annual Workshop on Optimization for Machine Learning

  25. arXiv:2011.10669  [pdf, other

    cs.AI cs.MA cs.SI

    A General Framework for Distributed Inference with Uncertain Models

    Authors: James Z. Hare, Cesar A. Uribe, Lance Kaplan, Ali Jadbabaie

    Abstract: This paper studies the problem of distributed classification with a network of heterogeneous agents. The agents seek to jointly identify the underlying target class that best describes a sequence of observations. The problem is first abstracted to a hypothesis-testing framework, where we assume that the agents seek to agree on the hypothesis (target class) that best matches the distribution of obs… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

  26. arXiv:2010.02634  [pdf, other

    cs.CV cs.LG cs.NE q-bio.NC

    How Convolutional Neural Network Architecture Biases Learned Opponency and Colour Tuning

    Authors: Ethan Harris, Daniela Mihai, Jonathon Hare

    Abstract: Recent work suggests that changing Convolutional Neural Network (CNN) architecture by introducing a bottleneck in the second layer can yield changes in learned function. To understand this relationship fully requires a way of quantitatively comparing trained networks. The fields of electrophysiology and psychophysics have developed a wealth of methods for characterising visual systems which permit… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: Final version; Accepted for publication in Neural Computation

  27. arXiv:2008.07922  [pdf, other

    cs.LG stat.ML

    Linear Disentangled Representations and Unsupervised Action Estimation

    Authors: Matthew Painter, Jonathon Hare, Adam Prugel-Bennett

    Abstract: Disentangled representation learning has seen a surge in interest over recent times, generally focusing on new models which optimise one of many disparate disentanglement metrics. Symmetry Based Disentangled Representation learning introduced a robust mathematical framework that defined precisely what is meant by a "linear disentangled representation". This framework determined that such represent… ▽ More

    Submitted 15 December, 2020; v1 submitted 18 August, 2020; originally announced August 2020.

    Journal ref: Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

  28. arXiv:2002.12047  [pdf, other

    cs.LG cs.CV cs.IT stat.ML

    FMix: Enhancing Mixed Sample Data Augmentation

    Authors: Ethan Harris, Antonia Marcu, Matthew Painter, Mahesan Niranjan, Adam Prügel-Bennett, Jonathon Hare

    Abstract: Mixed Sample Data Augmentation (MSDA) has received increasing attention in recent years, with many successful variants such as MixUp and CutMix. By studying the mutual information between the function learned by a VAE on the original data and on the augmented data we show that MixUp distorts learned functions in a way that CutMix does not. We further demonstrate this by showing that MixUp acts as… ▽ More

    Submitted 28 February, 2021; v1 submitted 27 February, 2020; originally announced February 2020.

    Comments: Code available at https://github.com/ecs-vlc/FMix

  29. arXiv:1911.05546  [pdf, other

    cs.LG cs.CV stat.ML

    Avoiding hashing and encouraging visual semantics in referential emergent language games

    Authors: Daniela Mihai, Jonathon Hare

    Abstract: There has been an increasing interest in the area of emergent communication between agents which learn to play referential signalling games with realistic images. In this work, we consider the signalling game setting of Havrylov and Titov and investigate the effect of the feature extractor's weights and of the task being solved on the visual semantics learned or captured by the models. We impose v… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

    Comments: 4 pages, presented at Emergent Communication: Towards Natural Language workshop (NeurIPS 2019)

  30. arXiv:1910.11251  [pdf, other

    stat.ME cs.SI math.OC

    Non-Bayesian Social Learning with Gaussian Uncertain Models

    Authors: James Z. Hare, Cesar Uribe, Lance Kaplan, Ali Jadbabaie

    Abstract: Non-Bayesian social learning theory provides a framework for distributed inference of a group of agents interacting over a social network by sequentially communicating and updating beliefs about the unknown state of the world through likelihood updates from their observations. Typically, likelihood models are assumed known precisely. However, in many situations the models are generated from sparse… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

  31. arXiv:1910.11086  [pdf, other

    cs.CV cs.LG stat.ML

    Spatial and Colour Opponency in Anatomically Constrained Deep Networks

    Authors: Ethan Harris, Daniela Mihai, Jonathon Hare

    Abstract: Colour vision has long fascinated scientists, who have sought to understand both the physiology of the mechanics of colour vision and the psychophysics of colour perception. We consider representations of colour in anatomically constrained convolutional deep neural networks. Following ideas from neuroscience, we classify cells in early layers into groups relating to their spectral and spatial func… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

  32. arXiv:1910.10795  [pdf, other

    eess.SY cs.MA

    POSE.R: Prediction-based Opportunistic Sensing for Resilient and Efficient Sensor Networks

    Authors: James Z. Hare, Junnan Song, Shalabh Gupta, Thomas A. Wettergren

    Abstract: The paper presents a distributed algorithm, called Prediction-based Opportunistic Sensing for Resilient and Efficient Sensor Networks (POSE.R), where the sensor nodes utilize predictions of the targets positions to probabilistically control their multi-modal operating states to track the target. There are two desired features of the algorithm: energy-efficiency and resilience. If the target is tra… ▽ More

    Submitted 2 September, 2020; v1 submitted 23 October, 2019; originally announced October 2019.

    Journal ref: ACM Transactions on Sensor Networks, Vol. 17, Issue 1, Article 5, pp. 1-41, 2020

  33. arXiv:1910.09281  [pdf, other

    cs.LG cs.AI stat.ML

    Dealing with Sparse Rewards in Reinforcement Learning

    Authors: Joshua Hare

    Abstract: Successfully navigating a complex environment to obtain a desired outcome is a difficult task, that up to recently was believed to be capable only by humans. This perception has been broken down over time, especially with the introduction of deep reinforcement learning, which has greatly increased the difficulty of tasks that can be automated. However, for traditional reinforcement learning agents… ▽ More

    Submitted 11 November, 2019; v1 submitted 21 October, 2019; originally announced October 2019.

  34. arXiv:1909.09228  [pdf, other

    cs.AI cs.MA cs.SI

    Non-Bayesian Social Learning with Uncertain Models

    Authors: James Z. Hare, Cesar A. Uribe, Lance Kaplan, Ali Jadbabaie

    Abstract: Non-Bayesian social learning theory provides a framework that models distributed inference for a group of agents interacting over a social network. In this framework, each agent iteratively forms and communicates beliefs about an unknown state of the world with their neighbors using a learning rule. Existing approaches assume agents have access to precise statistical models (in the form of likelih… ▽ More

    Submitted 27 September, 2019; v1 submitted 9 September, 2019; originally announced September 2019.

  35. arXiv:1909.04255  [pdf, other

    math.OC cs.MA cs.SI

    Non-Bayesian Social Learning with Uncertain Models over Time-Varying Directed Graphs

    Authors: César A. Uribe, James Z. Hare, Lance Kaplan, Ali Jadbabaie

    Abstract: We study the problem of non-Bayesian social learning with uncertain models, in which a network of agents seek to cooperatively identify the state of the world based on a sequence of observed signals. In contrast with the existing literature, we focus our attention on the scenario where the statistical models held by the agents about possible states of the world are built from finite observations.… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: To appear at CDC2019

  36. arXiv:1906.06565  [pdf, other

    cs.LG stat.ML

    Deep Set Prediction Networks

    Authors: Yan Zhang, Jonathon Hare, Adam Prügel-Bennett

    Abstract: Current approaches for predicting sets from feature vectors ignore the unordered nature of sets and suffer from discontinuity issues as a result. We propose a general model for predicting sets that properly respects the structure of sets and avoids this problem. With a single feature vector as input, we show that our model is able to auto-encode point sets, predict the set of bounding boxes of obj… ▽ More

    Submitted 24 April, 2020; v1 submitted 15 June, 2019; originally announced June 2019.

    Comments: Appendix C contains an erratum

    Journal ref: Advances in Neural Information Processing Systems 32 (NeurIPS 2019)

  37. arXiv:1906.02795  [pdf, other

    cs.LG cs.AI stat.ML

    FSPool: Learning Set Representations with Featurewise Sort Pooling

    Authors: Yan Zhang, Jonathon Hare, Adam Prügel-Bennett

    Abstract: Traditional set prediction models can struggle with simple datasets due to an issue we call the responsibility problem. We introduce a pooling method for sets of feature vectors based on sorting features across elements of the set. This can be used to construct a permutation-equivariant auto-encoder that avoids this responsibility problem. On a toy dataset of polygons and a set version of MNIST, w… ▽ More

    Submitted 1 May, 2020; v1 submitted 6 June, 2019; originally announced June 2019.

    Comments: Published at International Conference on Learning Representations (ICLR) 2020

  38. arXiv:1901.03665  [pdf, other

    cs.CV cs.LG stat.ML

    A Biologically Inspired Visual Working Memory for Deep Networks

    Authors: Ethan Harris, Mahesan Niranjan, Jonathon Hare

    Abstract: The ability to look multiple times through a series of pose-adjusted glimpses is fundamental to human vision. This critical faculty allows us to understand highly complex visual scenes. Short term memory plays an integral role in aggregating the information obtained from these glimpses and informing our interpretation of the scene. Computational models have attempted to address glimpsing and visua… ▽ More

    Submitted 9 January, 2019; originally announced January 2019.

  39. arXiv:1812.03928  [pdf, other

    cs.LG cs.CV stat.ML

    Learning Representations of Sets through Optimized Permutations

    Authors: Yan Zhang, Jonathon Hare, Adam Prügel-Bennett

    Abstract: Representations of sets are challenging to learn because operations on sets should be permutation-invariant. To this end, we propose a Permutation-Optimisation module that learns how to permute a set end-to-end. The permuted set can be further processed to learn a permutation-invariant representation of that set, avoiding a bottleneck in traditional set models. We demonstrate our model's ability t… ▽ More

    Submitted 14 January, 2019; v1 submitted 10 December, 2018; originally announced December 2018.

    Comments: Published in ICLR 2019

  40. arXiv:1809.03363  [pdf, ps, other

    cs.LG cs.AI cs.CV stat.ML

    Torchbearer: A Model Fitting Library for PyTorch

    Authors: Ethan Harris, Matthew Painter, Jonathon Hare

    Abstract: We introduce torchbearer, a model fitting library for pytorch aimed at researchers working on deep learning or differentiable programming. The torchbearer library provides a high level metric and callback API that can be used for a wide range of applications. We also include a series of built in callbacks that can be used for: model persistence, learning rate decay, logging, data visualization and… ▽ More

    Submitted 10 September, 2018; originally announced September 2018.

    Comments: 5 pages

  41. arXiv:1803.07116  [pdf, other

    cs.CL

    Learning to Generate Wikipedia Summaries for Underserved Languages from Wikidata

    Authors: Lucie-Aimée Kaffee, Hady Elsahar, Pavlos Vougiouklis, Christophe Gravier, Frédérique Laforest, Jonathon Hare, Elena Simperl

    Abstract: While Wikipedia exists in 287 languages, its content is unevenly distributed among them. In this work, we investigate the generation of open domain Wikipedia summaries in underserved languages using structured data from Wikidata. To this end, we propose a neural network architecture equipped with copy actions that learns to generate single-sentence and comprehensible textual summaries from Wikidat… ▽ More

    Submitted 29 April, 2018; v1 submitted 19 March, 2018; originally announced March 2018.

    Comments: NAACL HTL 2018

  42. arXiv:1802.05766  [pdf, other

    cs.CV cs.CL

    Learning to Count Objects in Natural Images for Visual Question Answering

    Authors: Yan Zhang, Jonathon Hare, Adam Prügel-Bennett

    Abstract: Visual Question Answering (VQA) models have struggled with counting objects in natural images so far. We identify a fundamental problem due to soft attention in these models as a cause. To circumvent this problem, we propose a neural network component that allows robust counting from object proposals. Experiments on a toy task show the effectiveness of this component and we obtain state-of-the-art… ▽ More

    Submitted 15 February, 2018; originally announced February 2018.

    Comments: Published in ICLR 2018

  43. Neural Wikipedian: Generating Textual Summaries from Knowledge Base Triples

    Authors: Pavlos Vougiouklis, Hady Elsahar, Lucie-Aimée Kaffee, Christoph Gravier, Frederique Laforest, Jonathon Hare, Elena Simperl

    Abstract: Most people do not interact with Semantic Web data directly. Unless they have the expertise to understand the underlying technology, they need textual or visual interfaces to help them make sense of it. We explore the problem of generating natural language summaries for Semantic Web data. This is non-trivial, especially in an open-domain context. To address this problem, we explore the use of neur… ▽ More

    Submitted 31 October, 2017; originally announced November 2017.