Skip to main content

Showing 1–27 of 27 results for author: Suarez, J

  1. arXiv:2406.12905  [pdf, other

    cs.LG cs.AI cs.MA

    PufferLib: Making Reinforcement Learning Libraries and Environments Play Nice

    Authors: Joseph Suarez

    Abstract: You have an environment, a model, and a reinforcement learning library that are designed to work together but don't. PufferLib makes them play nice. The library provides one-line environment wrappers that eliminate common compatibility problems and fast vectorization to accelerate training. With PufferLib, you can use familiar libraries like CleanRL and SB3 to scale from classic benchmarks like At… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2406.05071  [pdf, other

    cs.AI cs.LG cs.MA

    Massively Multiagent Minigames for Training Generalist Agents

    Authors: Kyoung Whan Choe, Ryan Sullivan, Joseph Suárez

    Abstract: We present Meta MMO, a collection of many-agent minigames for use as a reinforcement learning benchmark. Meta MMO is built on top of Neural MMO, a massively multiagent environment that has been the subject of two previous NeurIPS competitions. Our work expands Neural MMO with several computationally efficient minigames. We explore generalization across Meta MMO by learning to play several minigame… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2311.11795  [pdf, ps, other

    cs.PL

    Effects and Coeffects in Call-By-Push-Value (Extended Version)

    Authors: Cassia Torczon, Emmanuel Suárez Acevedo, Shubh Agrawal, Joey Velez-Ginorio, Stephanie Weirich

    Abstract: Effect and coeffect tracking are a useful way to integrate many types of compile-time analysis, such as cost, liveness or dataflow, into a language's type system. However, their interactions with call-by-push-value (CBPV), a computational model useful in compilation for its isolation of effects and its ability to encompass both call-by-name and call-by-value computations, are still poorly understo… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  4. arXiv:2311.03736  [pdf, other

    cs.AI cs.LG cs.MA

    Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning

    Authors: Joseph Suárez, Phillip Isola, Kyoung Whan Choe, David Bloomin, Hao Xiang Li, Nikhil Pinnaparaju, Nishaanth Kanna, Daniel Scott, Ryan Sullivan, Rose S. Shuman, Lucas de Alcântara, Herbie Bradley, Louis Castricato, Kirsty You, Yuhao Jiang, Qimai Li, Jiaxin Chen, Xiaolong Zhu

    Abstract: Neural MMO 2.0 is a massively multi-agent environment for reinforcement learning research. The key feature of this new version is a flexible task system that allows users to define a broad range of objectives and reward signals. We challenge researchers to train agents capable of generalizing to tasks, maps, and opponents never seen during training. Neural MMO features procedurally generated maps… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  5. arXiv:2311.03707  [pdf, other

    cs.AI cs.LG cs.MA

    The NeurIPS 2022 Neural MMO Challenge: A Massively Multiagent Competition with Specialization and Trade

    Authors: Enhong Liu, Joseph Suarez, Chenhui You, Bo Wu, Bingcheng Chen, Jun Hu, Jiaxin Chen, Xiaolong Zhu, Clare Zhu, Julian Togelius, Sharada Mohanty, Weijun Hong, Rui Du, Yibing Zhang, Qinwen Wang, Xinhang Li, Zheng Yuan, Xiang Li, Yuejia Huang, Kun Zhang, Hanhui Yang, Shiqi Tang, Phillip Isola

    Abstract: In this paper, we present the results of the NeurIPS-2022 Neural MMO Challenge, which attracted 500 participants and received over 1,600 submissions. Like the previous IJCAI-2022 Neural MMO Challenge, it involved agents from 16 populations surviving in procedurally generated worlds by collecting resources and defeating opponents. This year's competition runs on the latest v1.6 Neural MMO, which in… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  6. arXiv:2310.17805  [pdf, other

    cs.LG cs.AI

    Reward Scale Robustness for Proximal Policy Optimization via DreamerV3 Tricks

    Authors: Ryan Sullivan, Akarsh Kumar, Shengyi Huang, John P. Dickerson, Joseph Suarez

    Abstract: Most reinforcement learning methods rely heavily on dense, well-normalized environment rewards. DreamerV3 recently introduced a model-based method with a number of tricks that mitigate these limitations, achieving state-of-the-art on a wide range of benchmarks with a single set of hyperparameters. This result sparked discussion about the generality of the tricks, since they appear to be applicable… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  7. arXiv:2309.15724  [pdf, ps, other

    cs.PL

    Making Logical Relations More Relatable (Proof Pearl)

    Authors: Emmanuel Suárez Acevedo, Stephanie Weirich

    Abstract: Mechanical proofs by logical relations often involve tedious reasoning about substitution. In this paper, we show that this is not necessarily the case, by developing, in Agda, a proof that all simply typed lambda calculus expressions evaluate to values. A formalization of the proof is remarkably short (~40 lines of code), making for an excellent introduction to the technique of proofs by logical… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Submitted for publication

  8. arXiv:2308.15802  [pdf, other

    cs.AI

    Benchmarking Robustness and Generalization in Multi-Agent Systems: A Case Study on Neural MMO

    Authors: Yangkun Chen, Joseph Suarez, Junjie Zhang, Chenghui Yu, Bo Wu, Hanmo Chen, Hengman Zhu, Rui Du, Shanliang Qian, Shuai Liu, Weijun Hong, Jinke He, Yibing Zhang, Liang Zhao, Clare Zhu, Julian Togelius, Sharada Mohanty, Jiaxin Chen, Xiu Li, Xiaolong Zhu, Phillip Isola

    Abstract: We present the results of the second Neural MMO challenge, hosted at IJCAI 2022, which received 1600+ submissions. This competition targets robustness and generalization in multi-agent systems: participants train teams of agents to complete a multi-task objective against opponents not seen during training. The competition combines relatively complex environment design with large numbers of agents… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  9. arXiv:2307.07631  [pdf, other

    cs.LG

    Towards Model-Size Agnostic, Compute-Free, Memorization-based Inference of Deep Learning

    Authors: Davide Giacomini, Maeesha Binte Hashem, Jeremiah Suarez, Swarup Bhunia, Amit Ranjan Trivedi

    Abstract: The rapid advancement of deep neural networks has significantly improved various tasks, such as image and speech recognition. However, as the complexity of these models increases, so does the computational cost and the number of parameters, making it difficult to deploy them on resource-constrained devices. This paper proposes a novel memorization-based inference (MBI) that is compute free and onl… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  10. arXiv:2302.14060  [pdf, other

    cs.LG cs.AI

    Semi-supervised Clustering with Two Types of Background Knowledge: Fusing Pairwise Constraints and Monotonicity Constraints

    Authors: Germán González-Almagro, Juan Luis Suárez, Pablo Sánchez-Bermejo, José-Ramón Cano, Salvador García

    Abstract: This study addresses the problem of performing clustering in the presence of two types of background knowledge: pairwise constraints and monotonicity constraints. To achieve this, the formal framework to perform clustering under monotonicity constraints is, firstly, defined, resulting in a specific distance measure. Pairwise constraints are integrated afterwards by designing an objective function… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

  11. arXiv:2204.10491  [pdf, other

    cs.DS cs.DM

    Some Optimization Solutions for Relief Distribution

    Authors: Jhoirene Clemente, Jessie James Suarez, Olivia Demetria, Perry Go, Dylan Salcedo

    Abstract: Humanitarian logistics remain a challenging area of application for operations research. In relief distribution, the main goal is to deliver all the supplies to those that are in need in the fastest way possible. In this paper, we present different optimization solutions for relief distribution. We present a formalization of the three main problems in the humanitarian logistics aspect of relief di… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    ACM Class: J.7

    Journal ref: In Proceedings of the Philippine Computing Science Congress 2021

  12. arXiv:2202.06682  [pdf

    cs.HC

    Client Monitoring Software: A Monitoring Tool for Greatleaf Land Inc

    Authors: Carlo H. Godoy Jr., Jerico C. Torayno, Audrey Rose Abbey C. Magtarayo, Mark Wilson J. Suarez, Armando Embile, Daven Christian O. Estopia

    Abstract: Monitoring typically supports greater analysis and allows for a lot deeper data collection on a Web browser level. Analysts may usually see the use of web-based monitoring software within an entire client context when it comes to client-side monitoring, on the other hand. In the case of Greatleaf Land Inc., their monitoring is somehow traditional. Traditional means that the paper method of monitor… ▽ More

    Submitted 19 January, 2022; originally announced February 2022.

    Comments: 8 pages, 8 Figures, 4 Tables

    MSC Class: C.3 ACM Class: C.3

  13. arXiv:2110.07594  [pdf, other

    cs.LG cs.AI cs.MA

    The Neural MMO Platform for Massively Multiagent Research

    Authors: Joseph Suarez, Yilun Du, Clare Zhu, Igor Mordatch, Phillip Isola

    Abstract: Neural MMO is a computationally accessible research platform that combines large agent populations, long time horizons, open-ended tasks, and modular game systems. Existing environments feature subsets of these properties, but Neural MMO is the first to combine them all. We present Neural MMO as free and open source software with active support, ongoing development, documentation, and additional t… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

  14. arXiv:2009.14146  [pdf, other

    cs.CV cs.LG eess.IV

    A Survey on Deep Learning Techniques for Video Anomaly Detection

    Authors: Jessie James P. Suarez, Prospero C. Naval Jr

    Abstract: Anomaly detection in videos is a problem that has been studied for more than a decade. This area has piqued the interest of researchers due to its wide applicability. Because of this, there has been a wide array of approaches that have been proposed throughout the years and these approaches range from statistical-based approaches to machine learning-based approaches. Numerous surveys have already… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

  15. A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages

    Authors: Pedro Javier Ortiz Suárez, Laurent Romary, Benoît Sagot

    Abstract: We use the multilingual OSCAR corpus, extracted from Common Crawl via language classification, filtering and cleaning, to train monolingual contextualized word embeddings (ELMo) for five mid-resource languages. We then compare the performance of OSCAR-based and Wikipedia-based ELMo embeddings for these languages on the part-of-speech tagging and parsing tasks. We show that, despite the noise in th… ▽ More

    Submitted 18 June, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Journal ref: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, July 2020, Online

  16. arXiv:2006.01409  [pdf, other

    eess.IV cs.CV

    COVIDGR dataset and COVID-SDNet methodology for predicting COVID-19 based on Chest X-Ray images

    Authors: S. Tabik, A. Gómez-Ríos, J. L. Martín-Rodríguez, I. Sevillano-García, M. Rey-Area, D. Charte, E. Guirado, J. L. Suárez, J. Luengo, M. A. Valero-González, P. García-Villanova, E. Olmedo-Sánchez, F. Herrera

    Abstract: Currently, Coronavirus disease (COVID-19), one of the most infectious diseases in the 21st century, is diagnosed using RT-PCR testing, CT scans and/or Chest X-Ray (CXR) images. CT (Computed Tomography) scanners and RT-PCR testing are not available in most medical centers and hence in many cases CXR images become the most time/cost effective tool for assisting clinicians in making decisions. Deep l… ▽ More

    Submitted 11 November, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

    Comments: Paper accepted in Journal of Biomedical And Health Informatics

  17. arXiv:2005.13236  [pdf, ps, other

    cs.CL

    Establishing a New State-of-the-Art for French Named Entity Recognition

    Authors: Pedro Javier Ortiz Suárez, Yoann Dupont, Benjamin Muller, Laurent Romary, Benoît Sagot

    Abstract: The French TreeBank developed at the University Paris 7 is the main source of morphosyntactic and syntactic annotations for French. However, it does not include explicit information related to named entities, which are among the most useful information for several natural language processing tasks and applications. Moreover, no large-scale French corpus with named entity annotations contain refere… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

    Journal ref: LREC 2020 - 12th Language Resources and Evaluation Conference, May 2020, Marseille, France

  18. A Physiology-Driven Computational Model for Post-Cardiac Arrest Outcome Prediction

    Authors: Han B. Kim, Hieu Nguyen, Qingchu Jin, Sharmila Tamby, Tatiana Gelaf Romer, Eric Sung, Ran Liu, Joseph Greenstein, Jose I. Suarez, Christian Storm, Raimond Winslow, Robert D. Stevens

    Abstract: Patients resuscitated from cardiac arrest (CA) face a high risk of neurological disability and death, however pragmatic methods are lacking for accurate and reliable prognostication. The aim of this study was to build computational models to predict post-CA outcome by leveraging high-dimensional patient data available early after admission to the intensive care unit (ICU). We hypothesized that mod… ▽ More

    Submitted 11 February, 2020; v1 submitted 9 February, 2020; originally announced February 2020.

    Comments: 51 pages, 7 figures, 4 supplementary figures

    ACM Class: J.3; I.2.1; I.6.4; G.3

    Journal ref: Anaesthesia Critical Care & Pain Medicine 41.1 (2022): 101015

  19. arXiv:2001.12004  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Neural MMO v1.3: A Massively Multiagent Game Environment for Training and Evaluating Neural Networks

    Authors: Joseph Suarez, Yilun Du, Igor Mordatch, Phillip Isola

    Abstract: Progress in multiagent intelligence research is fundamentally limited by the number and quality of environments available for study. In recent years, simulated games have become a dominant research platform within reinforcement learning, in part due to their accessibility and interpretability. Previous works have targeted and demonstrated success on arcade, first person shooter (FPS), real-time st… ▽ More

    Submitted 16 April, 2020; v1 submitted 31 January, 2020; originally announced January 2020.

  20. CamemBERT: a Tasty French Language Model

    Authors: Louis Martin, Benjamin Muller, Pedro Javier Ortiz Suárez, Yoann Dupont, Laurent Romary, Éric Villemonte de la Clergerie, Djamé Seddah, Benoît Sagot

    Abstract: Pretrained language models are now ubiquitous in Natural Language Processing. Despite their success, most available models have either been trained on English data or on the concatenation of data in multiple languages. This makes practical use of such models --in all languages except English-- very limited. In this paper, we investigate the feasibility of training monolingual Transformer-based lan… ▽ More

    Submitted 21 May, 2020; v1 submitted 10 November, 2019; originally announced November 2019.

    Comments: ACL 2020 long paper. Web site: https://camembert-model.fr

    Journal ref: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, July 2020, Online

  21. arXiv:1904.00724  [pdf, other

    cs.CV cs.LG

    GAN You Do the GAN GAN?

    Authors: Joseph Suarez

    Abstract: Generative Adversarial Networks (GANs) have become a dominant class of generative models. In recent years, GAN variants have yielded especially impressive results in the synthesis of a variety of forms of data. Examples include compelling natural and artistic images, textures, musical sequences, and 3D object files. However, one obvious synthesis candidate is missing. In this work, we answer one o… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

    Comments: 3 pages

  22. arXiv:1903.00784  [pdf, other

    cs.MA cs.LG stat.ML

    Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents

    Authors: Joseph Suarez, Yilun Du, Phillip Isola, Igor Mordatch

    Abstract: The emergence of complex life on Earth is often attributed to the arms race that ensued from a huge number of organisms all competing for finite resources. We present an artificial intelligence research environment, inspired by the human game genre of MMORPGs (Massively Multiplayer Online Role-Playing Games, a.k.a. MMOs), that aims to simulate this setting in microcosm. As with MMORPGs and the rea… ▽ More

    Submitted 2 March, 2019; originally announced March 2019.

  23. arXiv:1803.11361  [pdf, other

    cs.CV

    DDRprog: A CLEVR Differentiable Dynamic Reasoning Programmer

    Authors: Joseph Suarez, Justin Johnson, Fei-Fei Li

    Abstract: We present a novel Dynamic Differentiable Reasoning (DDR) framework for jointly learning branching programs and the functions composing them; this resolves a significant nondifferentiability inhibiting recent dynamic architectures. We apply our framework to two settings in two highly compact and data efficient architectures: DDRprog for CLEVR Visual Question Answering and DDRstack for reverse Poli… ▽ More

    Submitted 30 March, 2018; originally announced March 2018.

  24. arXiv:1707.02402  [pdf, other

    cs.CV

    Effective Approaches to Batch Parallelization for Dynamic Neural Network Architectures

    Authors: Joseph Suarez, Clare Zhu

    Abstract: We present a simple dynamic batching approach applicable to a large class of dynamic architectures that consistently yields speedups of over 10x. We provide performance bounds when the architecture is not known a priori and a stronger bound in the special case where the architecture is a predetermined balanced tree. We evaluate our approach on Johnson et al.'s recent visual question answering (VQA… ▽ More

    Submitted 8 July, 2017; originally announced July 2017.

    Comments: Code at https://github.com/jsuarez5341/Efficient-Dynamic-Batching

  25. arXiv:1611.05360  [pdf

    cs.CL

    The Life of Lazarillo de Tormes and of His Machine Learning Adversities

    Authors: Javier de la Rosa, Juan-Luis Suárez

    Abstract: Summit work of the Spanish Golden Age and forefather of the so-called picaresque novel, The Life of Lazarillo de Tormes and of His Fortunes and Adversities still remains an anonymous text. Although distinguished scholars have tried to attribute it to different authors based on a variety of criteria, a consensus has yet to be reached. The list of candidates is long and not all of them enjoy the sam… ▽ More

    Submitted 16 November, 2016; originally announced November 2016.

    Comments: 66 pages, 11 figures

    Journal ref: Lemir: Revista de Literatura Española Medieval y del Renacimiento, 20 (2016)

  26. arXiv:1603.07635  [pdf, other

    cs.NI

    Exploring the Use of RPAs as 5G Points of Presence

    Authors: Javier Suarez, Ivan Vidal, Jaime Garcia-Reinoso, Francisco Valera, Arturo Azcorra

    Abstract: This paper presents an early exploration and preliminary results on the use of Remotely Piloted Aircrafts (RPA) as 5G points of presence. The use of RPAs in the 5G arena would enable a cost-effective deployment of functions over mobile nodes that could be integrated on demand into the programmable and unified 5G infrastructure, enhancing the capacity of the network to flexibly adapt to the particu… ▽ More

    Submitted 24 March, 2016; originally announced March 2016.

    Comments: Under review, EuCNC 2016

  27. arXiv:1409.4814  [pdf

    cs.AI cs.IR

    ICE: Enabling Non-Experts to Build Models Interactively for Large-Scale Lopsided Problems

    Authors: Patrice Simard, David Chickering, Aparna Lakshmiratan, Denis Charles, Leon Bottou, Carlos Garcia Jurado Suarez, David Grangier, Saleema Amershi, Johan Verwey, Jina Suh

    Abstract: Quick interaction between a human teacher and a learning machine presents numerous benefits and challenges when working with web-scale data. The human teacher guides the machine towards accomplishing the task of interest. The learning machine leverages big data to find examples that maximize the training value of its interaction with the teacher. When the teacher is restricted to labeling examples… ▽ More

    Submitted 16 September, 2014; originally announced September 2014.