Skip to main content

Showing 1–49 of 49 results for author: Hill, F

  1. arXiv:2404.10179  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    Scaling Instructable Agents Across Many Simulated Worlds

    Authors: SIMA Team, Maria Abi Raad, Arun Ahuja, Catarina Barros, Frederic Besse, Andrew Bolt, Adrian Bolton, Bethanie Brownfield, Gavin Buttimore, Max Cant, Sarah Chakera, Stephanie C. Y. Chan, Jeff Clune, Adrian Collister, Vikki Copeman, Alex Cullum, Ishita Dasgupta, Dario de Cesare, Julia Di Trapani, Yani Donchev, Emma Dunleavy, Martin Engelcke, Ryan Faulkner, Frankie Garcia, Charles Gbadamosi , et al. (68 additional authors not shown)

    Abstract: Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions, in order to accomplish complex tasks. The Scalable, Instructable, Multiworld Agent (SIMA) project tackles this by training agents to follow free-form instructio… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 March, 2024; originally announced April 2024.

  2. arXiv:2404.07129  [pdf, other

    cs.LG

    What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

    Authors: Aaditya K. Singh, Ted Moskovitz, Felix Hill, Stephanie C. Y. Chan, Andrew M. Saxe

    Abstract: In-context learning is a powerful emergent ability in transformer models. Prior work in mechanistic interpretability has identified a circuit element that may be critical for in-context learning -- the induction head (IH), which performs a match-and-copy operation. During training of large transformers on natural language data, IHs emerge around the same time as a notable phase change in the loss.… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 26 pages, 18 figures

  3. arXiv:2311.17901  [pdf, other

    cs.CV cs.AI cs.LG

    SODA: Bottleneck Diffusion Models for Representation Learning

    Authors: Drew A. Hudson, Daniel Zoran, Mateusz Malinowski, Andrew K. Lampinen, Andrew Jaegle, James L. McClelland, Loic Matthey, Felix Hill, Alexander Lerchner

    Abstract: We introduce SODA, a self-supervised diffusion model, designed for representation learning. The model incorporates an image encoder, which distills a source view into a compact representation, that, in turn, guides the generation of related novel views. We show that by imposing a tight bottleneck between the encoder and a denoising decoder, and leveraging novel view synthesis as a self-supervised… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  4. arXiv:2311.08360  [pdf, other

    cs.LG cs.AI cs.CL

    The Transient Nature of Emergent In-Context Learning in Transformers

    Authors: Aaditya K. Singh, Stephanie C. Y. Chan, Ted Moskovitz, Erin Grant, Andrew M. Saxe, Felix Hill

    Abstract: Transformer neural networks can exhibit a surprising capacity for in-context learning (ICL) despite not being explicitly trained for it. Prior work has provided a deeper understanding of how ICL emerges in transformers, e.g. through the lens of mechanistic interpretability, Bayesian inference, or by examining the distributional properties of training data. However, in each of these cases, ICL is t… ▽ More

    Submitted 11 December, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 19 pages, 16 figures

  5. arXiv:2303.07280  [pdf, other

    cs.CV cs.AI cs.LG

    Vision-Language Models as Success Detectors

    Authors: Yuqing Du, Ksenia Konyushkova, Misha Denil, Akhil Raju, Jessica Landon, Felix Hill, Nando de Freitas, Serkan Cabi

    Abstract: Detecting successful behaviour is crucial for training intelligent agents. As such, generalisable reward models are a prerequisite for agents that can learn to generalise their behaviour. In this work we focus on developing robust success detectors that leverage large, pretrained vision-language models (Flamingo, Alayrac et al. (2022)) and human reward annotations. Concretely, we treat success det… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

  6. arXiv:2302.04817  [pdf, other

    cs.LG

    The Edge of Orthogonality: A Simple View of What Makes BYOL Tick

    Authors: Pierre H. Richemond, Allison Tam, Yunhao Tang, Florian Strub, Bilal Piot, Felix Hill

    Abstract: Self-predictive unsupervised learning methods such as BYOL or SimSiam have shown impressive results, and counter-intuitively, do not collapse to trivial representations. In this work, we aim at exploring the simplest possible mathematical arguments towards explaining the underlying mechanisms behind self-predictive unsupervised learning. We start with the observation that those methods crucially r… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  7. arXiv:2302.00763  [pdf, other

    cs.LG cs.AI cs.CL

    Collaborating with language models for embodied reasoning

    Authors: Ishita Dasgupta, Christine Kaeser-Chen, Kenneth Marino, Arun Ahuja, Sheila Babayan, Felix Hill, Rob Fergus

    Abstract: Reasoning in a complex and ambiguous environment is a key goal for Reinforcement Learning (RL) agents. While some sophisticated RL agents can successfully solve difficult tasks, they require a large amount of training data and often struggle to generalize to new unseen environments and new tasks. On the other hand, Large Scale Language Models (LSLMs) have exhibited strong reasoning ability and the… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Comments: Presented at NeurIPS 2022 Language and Reinforcement Learning Workshop (best paper) and NeurIPS 2022 Foundation Models for Decision Making Workshop. 4 pages main; 14 pages total (including references and appendix); 3 figures

  8. arXiv:2301.05158  [pdf, other

    cs.CV cs.AI cs.LG

    SemPPL: Predicting pseudo-labels for better contrastive representations

    Authors: Matko Bošnjak, Pierre H. Richemond, Nenad Tomasev, Florian Strub, Jacob C. Walker, Felix Hill, Lars Holger Buesing, Razvan Pascanu, Charles Blundell, Jovana Mitrovic

    Abstract: Learning from large amounts of unsupervised data and a small amount of supervision is an important open problem in computer vision. We propose a new semi-supervised learning method, Semantic Positives via Pseudo-Labels (SemPPL), that combines labelled and unlabelled data to learn informative representations. Our method extends self-supervised contrastive learning -- where representations are shape… ▽ More

    Submitted 10 January, 2024; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: Published as a conference paper at ICLR 2023. For checkpoints and source code see https://github.com/google-deepmind/semppl

  9. arXiv:2210.05675  [pdf, other

    cs.CL cs.AI cs.LG

    Transformers generalize differently from information stored in context vs in weights

    Authors: Stephanie C. Y. Chan, Ishita Dasgupta, Junkyung Kim, Dharshan Kumaran, Andrew K. Lampinen, Felix Hill

    Abstract: Transformer models can use two fundamentally different kinds of information: information stored in weights during training, and information provided ``in-context'' at inference time. In this work, we show that transformers exhibit different inductive biases in how they represent and generalize from the information in these two sources. In particular, we characterize whether they generalize via par… ▽ More

    Submitted 13 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

  10. arXiv:2208.02957  [pdf, other

    cs.CL cs.AI

    Meaning without reference in large language models

    Authors: Steven T. Piantadosi, Felix Hill

    Abstract: The widespread success of large language models (LLMs) has been met with skepticism that they possess anything like human concepts or meanings. Contrary to claims that LLMs possess no meaning whatsoever, we argue that they likely capture important aspects of meaning, and moreover work in a way that approximates a compelling account of human cognition in which meaning arises from conceptual role. B… ▽ More

    Submitted 12 August, 2022; v1 submitted 4 August, 2022; originally announced August 2022.

  11. arXiv:2207.07051  [pdf, other

    cs.CL cs.AI cs.LG

    Language models show human-like content effects on reasoning tasks

    Authors: Ishita Dasgupta, Andrew K. Lampinen, Stephanie C. Y. Chan, Hannah R. Sheahan, Antonia Creswell, Dharshan Kumaran, James L. McClelland, Felix Hill

    Abstract: Abstract reasoning is a key ability for an intelligent system. Large language models (LMs) achieve above-chance performance on abstract reasoning tasks, but exhibit many imperfections. However, human abstract reasoning is also imperfect. For example, human reasoning is affected by our real-world knowledge and beliefs, and shows notable "content effects"; humans reason more reliably when the semant… ▽ More

    Submitted 30 October, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

  12. arXiv:2206.08349  [pdf, other

    cs.LG cs.AI cs.CL

    Know your audience: specializing grounded language models with listener subtraction

    Authors: Aaditya K. Singh, David Ding, Andrew Saxe, Felix Hill, Andrew K. Lampinen

    Abstract: Effective communication requires adapting to the idiosyncrasies of each communicative context--such as the common ground shared with each partner. Humans demonstrate this ability to specialize to their audience in many contexts, such as the popular game Dixit. We take inspiration from Dixit to formulate a multi-agent image reference game where a (trained) speaker model is rewarded for describing a… ▽ More

    Submitted 1 May, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: 28 pages, 9 figures

  13. arXiv:2205.05055  [pdf, other

    cs.LG cs.AI cs.CL

    Data Distributional Properties Drive Emergent In-Context Learning in Transformers

    Authors: Stephanie C. Y. Chan, Adam Santoro, Andrew K. Lampinen, Jane X. Wang, Aaditya Singh, Pierre H. Richemond, Jay McClelland, Felix Hill

    Abstract: Large transformer-based models are able to perform in-context few-shot learning, without being explicitly trained for it. This observation raises the question: what aspects of the training regime lead to this emergent behavior? Here, we show that this behavior is driven by the distributions of the training data itself. In-context learning emerges when the training data exhibits particular distribu… ▽ More

    Submitted 17 November, 2022; v1 submitted 22 April, 2022; originally announced May 2022.

    Comments: Accepted at NeurIPS 2022 (Oral). Code is available at: https://github.com/deepmind/emergent_in_context_learning

  14. arXiv:2204.05080  [pdf, other

    cs.LG cs.AI

    Semantic Exploration from Language Abstractions and Pretrained Representations

    Authors: Allison C. Tam, Neil C. Rabinowitz, Andrew K. Lampinen, Nicholas A. Roy, Stephanie C. Y. Chan, DJ Strouse, Jane X. Wang, Andrea Banino, Felix Hill

    Abstract: Effective exploration is a challenge in reinforcement learning (RL). Novelty-based exploration methods can suffer in high-dimensional state spaces, such as continuous partially-observable 3D environments. We address this challenge by defining novelty using semantically meaningful state abstractions, which can be found in learned representations shaped by natural language. In particular, we evaluat… ▽ More

    Submitted 26 April, 2023; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: NeurIPS 2022

  15. arXiv:2204.02329  [pdf, other

    cs.CL cs.AI cs.LG

    Can language models learn from explanations in context?

    Authors: Andrew K. Lampinen, Ishita Dasgupta, Stephanie C. Y. Chan, Kory Matthewson, Michael Henry Tessler, Antonia Creswell, James L. McClelland, Jane X. Wang, Felix Hill

    Abstract: Language Models (LMs) can perform new tasks by adapting to a few in-context examples. For humans, explanations that connect examples to task principles can improve learning. We therefore investigate whether explanations of few-shot examples can help LMs. We annotate questions from 40 challenging tasks with answer explanations, and various matched control explanations. We evaluate how different typ… ▽ More

    Submitted 10 October, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: Findings of EMNLP 2022

  16. arXiv:2203.08222  [pdf, other

    cs.LG

    Zipfian environments for Reinforcement Learning

    Authors: Stephanie C. Y. Chan, Andrew K. Lampinen, Pierre H. Richemond, Felix Hill

    Abstract: As humans and animals learn in the natural world, they encounter distributions of entities, situations and events that are far from uniform. Typically, a relatively small set of experiences are encountered frequently, while many important experiences occur only rarely. The highly-skewed, heavy-tailed nature of reality poses particular learning challenges that humans and animals have met by evolvin… ▽ More

    Submitted 8 August, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

  17. arXiv:2112.08369  [pdf, other

    cs.LG cs.AI

    Feature-Attending Recurrent Modules for Generalization in Reinforcement Learning

    Authors: Wilka Carvalho, Andrew Lampinen, Kyriacos Nikiforou, Felix Hill, Murray Shanahan

    Abstract: Many important tasks are defined in terms of object. To generalize across these tasks, a reinforcement learning (RL) agent needs to exploit the structure that the objects induce. Prior work has either hard-coded object-centric features, used complex object-centric generative models, or updated state using local spatial features. However, these approaches have had limited success in enabling genera… ▽ More

    Submitted 3 November, 2023; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: TMLR, 2023

  18. arXiv:2112.03763  [pdf, other

    cs.LG

    Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning

    Authors: DeepMind Interactive Agents Team, Josh Abramson, Arun Ahuja, Arthur Brussee, Federico Carnevale, Mary Cassin, Felix Fischer, Petko Georgiev, Alex Goldin, Mansi Gupta, Tim Harley, Felix Hill, Peter C Humphreys, Alden Hung, Jessica Landon, Timothy Lillicrap, Hamza Merzic, Alistair Muldal, Adam Santoro, Guy Scully, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan, Rui Zhu

    Abstract: A common vision from science fiction is that robots will one day inhabit our physical spaces, sense the world as we do, assist our physical labours, and communicate with us through natural language. Here we study how to design artificial agents that can interact naturally with humans using the simplification of a virtual environment. We show that imitation learning of human-human interactions in a… ▽ More

    Submitted 2 February, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

  19. arXiv:2112.03753  [pdf, other

    cs.LG cs.AI stat.ML

    Tell me why! Explanations support learning relational and causal structure

    Authors: Andrew K. Lampinen, Nicholas A. Roy, Ishita Dasgupta, Stephanie C. Y. Chan, Allison C. Tam, James L. McClelland, Chen Yan, Adam Santoro, Neil C. Rabinowitz, Jane X. Wang, Felix Hill

    Abstract: Inferring the abstract relational and causal structure of the world is a major challenge for reinforcement-learning (RL) agents. For humans, language--particularly in the form of explanations--plays a considerable role in overcoming this challenge. Here, we show that language can play a similar role for deep RL agents in complex environments. While agents typically struggle to acquire relational a… ▽ More

    Submitted 25 May, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: ICML 2022; 23 pages

    ACM Class: I.2.6

  20. arXiv:2110.09147  [pdf, other

    cs.CL cs.AI cs.LG

    BEAMetrics: A Benchmark for Language Generation Evaluation Evaluation

    Authors: Thomas Scialom, Felix Hill

    Abstract: Natural language processing (NLP) systems are increasingly trained to generate open-ended text rather than classifying between responses. This makes research on evaluation metrics for generated language -- functions that score system output given the context and/or human reference responses -- of critical importance. However, different metrics have different strengths and biases, and reflect human… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

  21. arXiv:2106.13884  [pdf, other

    cs.CV cs.CL cs.LG

    Multimodal Few-Shot Learning with Frozen Language Models

    Authors: Maria Tsimpoukelli, Jacob Menick, Serkan Cabi, S. M. Ali Eslami, Oriol Vinyals, Felix Hill

    Abstract: When trained at sufficient scale, auto-regressive language models exhibit the notable ability to learn a new language task after being prompted with just a few examples. Here, we present a simple, yet effective, approach for transferring this few-shot learning ability to a multimodal setting (vision and language). Using aligned image and caption data, we train a vision encoder to represent each im… ▽ More

    Submitted 3 July, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

  22. arXiv:2105.14039  [pdf, other

    cs.LG cs.AI cs.NE

    Towards mental time travel: a hierarchical memory for reinforcement learning agents

    Authors: Andrew Kyle Lampinen, Stephanie C. Y. Chan, Andrea Banino, Felix Hill

    Abstract: Reinforcement learning agents often forget details of the past, especially after delays or distractor tasks. Agents with common memory architectures struggle to recall and integrate across multiple timesteps of a past event, or even to recall the details of a single timestep that is followed by distractor tasks. To address these limitations, we propose a Hierarchical Chunk Attention Memory (HCAM),… ▽ More

    Submitted 8 December, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: NeurIPS 2021; 10 pages main text; 29 pages total

    ACM Class: I.2.6

    Journal ref: Advances in Neural Information Processing Systems, 2021

  23. arXiv:2012.08508  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Attention over learned object embeddings enables complex visual reasoning

    Authors: David Ding, Felix Hill, Adam Santoro, Malcolm Reynolds, Matt Botvinick

    Abstract: Neural networks have achieved success in a wide array of perceptual tasks but often fail at tasks involving both perception and higher-level reasoning. On these more challenging tasks, bespoke approaches (such as modular symbolic components, independent dynamics models or semantic parsers) targeted towards that specific type of task have typically performed better. The downside to these targeted a… ▽ More

    Submitted 26 October, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

    Comments: 22 pages, 5 figures

  24. arXiv:2012.05672  [pdf, other

    cs.LG cs.AI cs.MA

    Imitating Interactive Intelligence

    Authors: Josh Abramson, Arun Ahuja, Iain Barr, Arthur Brussee, Federico Carnevale, Mary Cassin, Rachita Chhaparia, Stephen Clark, Bogdan Damoc, Andrew Dudzik, Petko Georgiev, Aurelia Guy, Tim Harley, Felix Hill, Alden Hung, Zachary Kenton, Jessica Landon, Timothy Lillicrap, Kory Mathewson, Soňa Mokrá, Alistair Muldal, Adam Santoro, Nikolay Savinov, Vikrant Varma, Greg Wayne , et al. (4 additional authors not shown)

    Abstract: A common vision from science fiction is that robots will one day inhabit our physical spaces, sense the world as we do, assist our physical labours, and communicate with us through natural language. Here we study how to design artificial agents that can interact naturally with humans using the simplification of a virtual environment. This setting nevertheless integrates a number of the central cha… ▽ More

    Submitted 20 January, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

  25. arXiv:2009.01719  [pdf, other

    cs.CL cs.AI

    Grounded Language Learning Fast and Slow

    Authors: Felix Hill, Olivier Tieleman, Tamara von Glehn, Nathaniel Wong, Hamza Merzic, Stephen Clark

    Abstract: Recent work has shown that large text-based neural language models, trained with conventional supervised learning objectives, acquire a surprising propensity for few- and one-shot learning. Here, we show that an embodied agent situated in a simulated 3D world, and endowed with a novel dual-coding external memory, can exhibit similar one-shot word learning when trained with conventional reinforceme… ▽ More

    Submitted 14 October, 2020; v1 submitted 3 September, 2020; originally announced September 2020.

  26. arXiv:2006.01016  [pdf, other

    cs.AI cs.CL cs.LG

    Probing Emergent Semantics in Predictive Agents via Question Answering

    Authors: Abhishek Das, Federico Carnevale, Hamza Merzic, Laura Rimell, Rosalia Schneider, Josh Abramson, Alden Hung, Arun Ahuja, Stephen Clark, Gregory Wayne, Felix Hill

    Abstract: Recent work has shown how predictive modeling can endow agents with rich knowledge of their surroundings, improving their ability to act in complex environments. We propose question-answering as a general paradigm to decode and understand the representations that such agents develop, applying our method to two recent approaches to predictive modeling -action-conditional CPC (Guo et al., 2018) and… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

    Comments: ICML 2020

  27. arXiv:2005.09382  [pdf, other

    cs.CL

    Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text

    Authors: Felix Hill, Sona Mokra, Nathaniel Wong, Tim Harley

    Abstract: Recent work has described neural-network-based agents that are trained with reinforcement learning (RL) to execute language-like commands in simulated worlds, as a step towards an intelligent agent or robot that can be instructed by human users. However, the optimisation of multi-goal motor policies via deep RL from scratch requires many episodes of experience. Consequently, instruction-following… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

  28. arXiv:1912.05877  [pdf, other

    cs.CL cs.AI

    Extending Machine Language Models toward Human-Level Language Understanding

    Authors: James L. McClelland, Felix Hill, Maja Rudolph, Jason Baldridge, Hinrich Schütze

    Abstract: Language is crucial for human intelligence, but what exactly is its role? We take language to be a part of a system for understanding and communicating about situations. The human ability to understand and communicate about situations emerges gradually from experience and depends on domain-general principles of biological neural networks: connection-based learning, distributed representation, and… ▽ More

    Submitted 4 July, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

  29. arXiv:1910.00571  [pdf, other

    cs.AI

    Environmental drivers of systematicity and generalization in a situated agent

    Authors: Felix Hill, Andrew Lampinen, Rosalia Schneider, Stephen Clark, Matthew Botvinick, James L. McClelland, Adam Santoro

    Abstract: The question of whether deep neural networks are good at generalising beyond their immediate training experience is of critical importance for learning-based approaches to AI. Here, we consider tests of out-of-sample generalisation that require an agent to respond to never-seen-before instructions by manipulating and positioning objects in a 3D Unity simulated room. We first describe a comparative… ▽ More

    Submitted 19 February, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

  30. arXiv:1909.00303  [pdf, other

    cs.CL

    Higher-order Comparisons of Sentence Encoder Representations

    Authors: Mostafa Abdou, Artur Kulmizev, Felix Hill, Daniel M. Low, Anders Søgaard

    Abstract: Representational Similarity Analysis (RSA) is a technique developed by neuroscientists for comparing activity patterns of different measurement modalities (e.g., fMRI, electrophysiology, behavior). As a framework, RSA has several advantages over existing approaches to interpretation of language encoders based on probing or diagnostic classification: namely, it does not require large training sampl… ▽ More

    Submitted 5 September, 2019; v1 submitted 31 August, 2019; originally announced September 2019.

    Comments: EMNLP 2019

  31. arXiv:1905.00537  [pdf, other

    cs.CL cs.AI

    SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems

    Authors: Alex Wang, Yada Pruksachatkun, Nikita Nangia, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, Samuel R. Bowman

    Abstract: In the last year, new models and methods for pretraining and transfer learning have driven striking performance improvements across a range of language understanding tasks. The GLUE benchmark, introduced a little over one year ago, offers a single-number metric that summarizes progress on a diverse set of such tasks, but performance on the benchmark has recently surpassed the level of non-expert h… ▽ More

    Submitted 12 February, 2020; v1 submitted 1 May, 2019; originally announced May 2019.

    Comments: NeurIPS 2019, super.gluebenchmark.com updating acknowledegments

  32. arXiv:1904.10396  [pdf, other

    q-bio.NC cs.AI cs.LG

    Is coding a relevant metaphor for building AI? A commentary on "Is coding a relevant metaphor for the brain?", by Romain Brette

    Authors: Adam Santoro, Felix Hill, David Barrett, David Raposo, Matthew Botvinick, Timothy Lillicrap

    Abstract: Brette contends that the neural coding metaphor is an invalid basis for theories of what the brain does. Here, we argue that it is an insufficient guide for building an artificial intelligence that learns to accomplish short- and long-term goals in a complex, changing environment.

    Submitted 18 April, 2019; originally announced April 2019.

  33. arXiv:1904.01557  [pdf, other

    cs.LG stat.ML

    Analysing Mathematical Reasoning Abilities of Neural Models

    Authors: David Saxton, Edward Grefenstette, Felix Hill, Pushmeet Kohli

    Abstract: Mathematical reasoning---a core ability within human intelligence---presents some unique challenges as a domain: we do not come to understand and solve mathematical problems primarily on the back of experience and evidence, but on the basis of inferring, learning, and exploiting laws, axioms, and symbol manipulation rules. In this paper, we present a new challenge for the evaluation (and eventuall… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

  34. arXiv:1902.00120  [pdf, other

    cs.AI

    Learning to Make Analogies by Contrasting Abstract Relational Structure

    Authors: Felix Hill, Adam Santoro, David G. T. Barrett, Ari S. Morcos, Timothy Lillicrap

    Abstract: Analogical reasoning has been a principal focus of various waves of AI research. Analogy is particularly challenging for machines because it requires relational structures to be represented such that they can be flexibly applied across diverse domains of experience. Here, we study how analogical reasoning can be induced in neural networks that learn to perceive and reason about raw visual data. We… ▽ More

    Submitted 31 January, 2019; originally announced February 2019.

  35. arXiv:1812.00898  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    Generating Diverse Programs with Instruction Conditioned Reinforced Adversarial Learning

    Authors: Aishwarya Agrawal, Mateusz Malinowski, Felix Hill, Ali Eslami, Oriol Vinyals, Tejas Kulkarni

    Abstract: Advances in Deep Reinforcement Learning have led to agents that perform well across a variety of sensory-motor domains. In this work, we study the setting in which an agent must learn to generate programs for diverse scenes conditioned on a given symbolic instruction. Final goals are specified to our agent via images of the scenes. A symbolic instruction consistent with the goal images is used as… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

  36. arXiv:1808.00508  [pdf, other

    cs.NE

    Neural Arithmetic Logic Units

    Authors: Andrew Trask, Felix Hill, Scott Reed, Jack Rae, Chris Dyer, Phil Blunsom

    Abstract: Neural networks can learn to represent and manipulate numerical information, but they seldom generalize well outside of the range of numerical values encountered during training. To encourage more systematic numerical extrapolation, we propose an architecture that represents numerical quantities as linear activations which are manipulated using primitive arithmetic operators, controlled by learned… ▽ More

    Submitted 1 August, 2018; originally announced August 2018.

  37. arXiv:1807.04225  [pdf, other

    cs.LG stat.ML

    Measuring abstract reasoning in neural networks

    Authors: David G. T. Barrett, Felix Hill, Adam Santoro, Ari S. Morcos, Timothy Lillicrap

    Abstract: Whether neural networks can learn abstract reasoning or whether they merely rely on superficial statistics is a topic of recent debate. Here, we propose a dataset and challenge designed to probe abstract reasoning, inspired by a well-known human IQ test. To succeed at this challenge, models must cope with various generalisation `regimes' in which the training and test data differ in clearly-define… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: ICML 2018

  38. arXiv:1806.01946  [pdf, other

    cs.AI cs.LG

    Learning to Understand Goal Specifications by Modelling Reward

    Authors: Dzmitry Bahdanau, Felix Hill, Jan Leike, Edward Hughes, Arian Hosseini, Pushmeet Kohli, Edward Grefenstette

    Abstract: Recent work has shown that deep reinforcement-learning agents can learn to follow language-like instructions from infrequent environment rewards. However, this places on environment designers the onus of designing language-conditional reward functions which may not be easily or tractably implemented as the complexity of the environment and the language scales. To overcome this limitation, we prese… ▽ More

    Submitted 23 December, 2019; v1 submitted 5 June, 2018; originally announced June 2018.

    Comments: 19 pages, 9 figures

  39. arXiv:1804.07461  [pdf, other

    cs.CL

    GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding

    Authors: Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, Samuel R. Bowman

    Abstract: For natural language understanding (NLU) technology to be maximally useful, both practically and as a scientific object of study, it must be general: it must be able to process language in a way that is not exclusively tailored to any one specific task or dataset. In pursuit of this objective, we introduce the General Language Understanding Evaluation benchmark (GLUE), a tool for evaluating and an… ▽ More

    Submitted 22 February, 2019; v1 submitted 20 April, 2018; originally announced April 2018.

    Comments: ICLR 2019; https://gluebenchmark.com/

  40. arXiv:1710.09867  [pdf, other

    cs.CL cs.AI cs.NE

    Understanding Early Word Learning in Situated Artificial Agents

    Authors: Felix Hill, Stephen Clark, Karl Moritz Hermann, Phil Blunsom

    Abstract: Neural network-based systems can now learn to locate the referents of words and phrases in images, answer questions about visual scenes, and execute symbolic instructions as first-person actors in partially-observable worlds. To achieve this so-called grounded language learning, models must overcome challenges that infants face when learning their first words. While it is notable that models with… ▽ More

    Submitted 1 October, 2019; v1 submitted 26 October, 2017; originally announced October 2017.

  41. arXiv:1706.06551  [pdf, other

    cs.CL cs.LG stat.ML

    Grounded Language Learning in a Simulated 3D World

    Authors: Karl Moritz Hermann, Felix Hill, Simon Green, Fumin Wang, Ryan Faulkner, Hubert Soyer, David Szepesvari, Wojciech Marian Czarnecki, Max Jaderberg, Denis Teplyashin, Marcus Wainwright, Chris Apps, Demis Hassabis, Phil Blunsom

    Abstract: We are increasingly surrounded by artificially intelligent technology that takes decisions and executes actions on our behalf. This creates a pressing need for general means to communicate with, instruct and guide artificial agents, with human language the most compelling means for such communication. To achieve this in a scalable fashion, agents must be able to relate language to the world and to… ▽ More

    Submitted 26 June, 2017; v1 submitted 20 June, 2017; originally announced June 2017.

    Comments: 16 pages, 8 figures

  42. arXiv:1608.02117  [pdf, other

    cs.CL

    HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment

    Authors: Ivan Vulić, Daniela Gerz, Douwe Kiela, Felix Hill, Anna Korhonen

    Abstract: We introduce HyperLex - a dataset and evaluation resource that quantifies the extent of of the semantic category membership, that is, type-of relation also known as hyponymy-hypernymy or lexical entailment (LE) relation between 2,616 concept pairs. Cognitive psychology research has established that typicality and category/class membership are computed in human semantic memory as a gradual rather t… ▽ More

    Submitted 10 May, 2017; v1 submitted 6 August, 2016; originally announced August 2016.

  43. arXiv:1608.00869  [pdf, other

    cs.CL

    SimVerb-3500: A Large-Scale Evaluation Set of Verb Similarity

    Authors: Daniela Gerz, Ivan Vulić, Felix Hill, Roi Reichart, Anna Korhonen

    Abstract: Verbs play a critical role in the meaning of sentences, but these ubiquitous words have received little attention in recent distributional semantics research. We introduce SimVerb-3500, an evaluation resource that provides human ratings for the similarity of 3,500 verb pairs. SimVerb-3500 covers all normed verb types from the USF free-association database, providing at least three examples for eve… ▽ More

    Submitted 20 September, 2016; v1 submitted 2 August, 2016; originally announced August 2016.

    Comments: EMNLP 2016

  44. arXiv:1602.03483  [pdf, ps, other

    cs.CL cs.LG

    Learning Distributed Representations of Sentences from Unlabelled Data

    Authors: Felix Hill, Kyunghyun Cho, Anna Korhonen

    Abstract: Unsupervised methods for learning distributed representations of words are ubiquitous in today's NLP research, but far less is known about the best ways to learn distributed phrase or sentence representations from unlabelled data. This paper is a systematic comparison of models that learn such representations. We find that the optimal approach depends critically on the intended application. Deeper… ▽ More

    Submitted 10 February, 2016; originally announced February 2016.

  45. arXiv:1511.02301  [pdf, other

    cs.CL

    The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations

    Authors: Felix Hill, Antoine Bordes, Sumit Chopra, Jason Weston

    Abstract: We introduce a new test of how well language models capture meaning in children's books. Unlike standard language modelling benchmarks, it distinguishes the task of predicting syntactic function words from that of predicting lower-frequency words, which carry greater semantic content. We compare a range of state-of-the-art models, each with a different way of encoding what has been previously read… ▽ More

    Submitted 1 April, 2016; v1 submitted 6 November, 2015; originally announced November 2015.

  46. arXiv:1504.00548  [pdf, ps, other

    cs.CL

    Learning to Understand Phrases by Embedding the Dictionary

    Authors: Felix Hill, Kyunghyun Cho, Anna Korhonen, Yoshua Bengio

    Abstract: Distributional models that learn rich semantic word representations are a success story of recent NLP research. However, developing models that learn useful representations of phrases and sentences has proved far harder. We propose using the definitions found in everyday dictionaries as a means of bridging this gap between lexical and phrasal semantics. Neural language embedding models can be effe… ▽ More

    Submitted 22 March, 2016; v1 submitted 2 April, 2015; originally announced April 2015.

  47. arXiv:1412.6448  [pdf, other

    cs.CL

    Embedding Word Similarity with Neural Machine Translation

    Authors: Felix Hill, Kyunghyun Cho, Sebastien Jean, Coline Devin, Yoshua Bengio

    Abstract: Neural language models learn word representations, or embeddings, that capture rich linguistic and conceptual information. Here we investigate the embeddings learned by neural machine translation models, a recently-developed class of neural language model. We show that embeddings from translation models outperform those learned by monolingual models at tasks that require knowledge of both conceptu… ▽ More

    Submitted 3 April, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

    Comments: arXiv admin note: text overlap with arXiv:1410.0718

  48. arXiv:1410.0718  [pdf, other

    cs.CL

    Not All Neural Embeddings are Born Equal

    Authors: Felix Hill, KyungHyun Cho, Sebastien Jean, Coline Devin, Yoshua Bengio

    Abstract: Neural language models learn word representations that capture rich linguistic and conceptual information. Here we investigate the embeddings learned by neural machine translation models. We show that translation-based embeddings outperform those learned by cutting-edge monolingual models at single-language tasks requiring knowledge of conceptual similarity and/or syntactic role. The findings sugg… ▽ More

    Submitted 13 November, 2014; v1 submitted 2 October, 2014; originally announced October 2014.

    Comments: 4 pages plus 1 page of references

  49. arXiv:1408.3456  [pdf, other

    cs.CL

    SimLex-999: Evaluating Semantic Models with (Genuine) Similarity Estimation

    Authors: Felix Hill, Roi Reichart, Anna Korhonen

    Abstract: We present SimLex-999, a gold standard resource for evaluating distributional semantic models that improves on existing resources in several important ways. First, in contrast to gold standards such as WordSim-353 and MEN, it explicitly quantifies similarity rather than association or relatedness, so that pairs of entities that are associated but not actually similar [Freud, psychology] have a low… ▽ More

    Submitted 14 August, 2014; originally announced August 2014.