Skip to main content

Showing 1–50 of 83 results for author: Martin, P

  1. arXiv:2406.18571  [pdf, other

    cs.CV

    UltraCortex: Submillimeter Ultra-High Field 9.4 T1 Brain MR Image Collection and Manual Cortical Segmentations

    Authors: Lucas Mahler, Julius Steiglechner, Benjamin Bender, Tobias Lindig, Dana Ramadan, Jonas Bause, Florian Birk, Rahel Heule, Edyta Charyasz, Michael Erb, Vinod Jangir Kumar, Gisela E Hagberg, Pascal Martin, Gabriele Lohmann, Klaus Scheffler

    Abstract: The UltraCortex repository (https://www.ultracortex.org) houses magnetic resonance imaging data of the human brain obtained at an ultra-high field strength of 9.4 T. It contains 86 structural MR images with spatial resolutions ranging from 0.6 to 0.8 mm. Additionally, the repository includes segmentations of 12 brains into gray and white matter compartments. These segmentations have been independe… ▽ More

    Submitted 5 July, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2405.20078  [pdf

    cs.MM

    NeRF View Synthesis: Subjective Quality Assessment and Objective Metrics Evaluation

    Authors: Pedro Martin, Antonio Rodrigues, Joao Ascenso, Maria Paula Queluz

    Abstract: Neural radiance fields (NeRF) are a groundbreaking computer vision technology that enables the generation of high-quality, immersive visual content from multiple viewpoints. This capability holds significant advantages for applications such as virtual/augmented reality, 3D modelling and content creation for the film and entertainment industry. However, the evaluation of NeRF methods poses several… ▽ More

    Submitted 31 May, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2405.15338  [pdf, other

    cs.SD eess.AS

    SoundLoCD: An Efficient Conditional Discrete Contrastive Latent Diffusion Model for Text-to-Sound Generation

    Authors: Xinlei Niu, Jing Zhang, Christian Walder, Charles Patrick Martin

    Abstract: We present SoundLoCD, a novel text-to-sound generation framework, which incorporates a LoRA-based conditional discrete contrastive latent diffusion model. Unlike recent large-scale sound generation models, our model can be efficiently trained under limited computational resources. The integration of a contrastive learning strategy further enhances the connection between text conditions and the gen… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  4. arXiv:2405.04592  [pdf

    cs.LG

    Integrating knowledge-guided symbolic regression and model-based design of experiments to automate process flow diagram development

    Authors: Alexander W. Rogers, Amanda Lane, Cesar Mendoza, Simon Watson, Adam Kowalski, Philip Martin, Dongda Zhang

    Abstract: New products must be formulated rapidly to succeed in the global formulated product market; however, key product indicators (KPIs) can be complex, poorly understood functions of the chemical composition and processing history. Consequently, scale-up must currently undergo expensive trial-and-error campaigns. To accelerate process flow diagram (PFD) optimisation and knowledge discovery, this work p… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  5. arXiv:2405.01886  [pdf, other

    cs.CL cs.AI

    Aloe: A Family of Fine-tuned Open Healthcare LLMs

    Authors: Ashwin Kumar Gururajan, Enrique Lopez-Cuena, Jordi Bayarri-Planas, Adrian Tormos, Daniel Hinjos, Pablo Bernabeu-Perez, Anna Arias-Duart, Pablo Agustin Martin-Torres, Lucia Urcelay-Ganzabal, Marta Gonzalez-Mallo, Sergio Alvarez-Napagao, Eduard Ayguadé-Parra, Ulises Cortés Dario Garcia-Gasulla

    Abstract: As the capabilities of Large Language Models (LLMs) in healthcare and medicine continue to advance, there is a growing need for competitive open-source models that can safeguard public interest. With the increasing availability of highly competitive open base models, the impact of continued pre-training is increasingly uncertain. In this work, we explore the role of instruct tuning, model merging,… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Five appendix

  6. arXiv:2404.15637  [pdf, other

    cs.SD cs.MM eess.AS

    HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts

    Authors: Xinlei Niu, Jing Zhang, Charles Patrick Martin

    Abstract: We introduce HybridVC, a voice conversion (VC) framework built upon a pre-trained conditional variational autoencoder (CVAE) that combines the strengths of a latent model with contrastive learning. HybridVC supports text and audio prompts, enabling more flexible voice style conversion. HybridVC models a latent distribution conditioned on speaker embeddings acquired by a pretrained speaker encoder… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  7. arXiv:2404.12356  [pdf, other

    stat.ML cs.LG cs.SI

    Improving the interpretability of GNN predictions through conformal-based graph sparsification

    Authors: Pablo Sanchez-Martin, Kinaan Aamir Khan, Isabel Valera

    Abstract: Graph Neural Networks (GNNs) have achieved state-of-the-art performance in solving graph classification tasks. However, most GNN architectures aggregate information from all nodes and edges in a graph, regardless of their relevance to the task at hand, thus hindering the interpretability of their predictions. In contrast to prior work, in this paper we propose a GNN \emph{training} approach that j… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  8. arXiv:2403.17776  [pdf, other

    cs.SI cs.HC stat.ME

    Exploring the Boundaries of Ambient Awareness in Twitter

    Authors: Pablo Sanchez-Martin, Sonja Utz, Isabel Valera

    Abstract: Ambient awareness refers to the ability of social media users to obtain knowledge about who knows what (i.e., users' expertise) in their network, by simply being exposed to other users' content (e.g, tweets on Twitter). Previous work, based on user surveys, reveals that individuals self-report ambient awareness only for parts of their networks. However, it is unclear whether it is their limited co… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  9. arXiv:2309.08048  [pdf, other

    cs.CV cs.AI

    Padding Aware Neurons

    Authors: Dario Garcia-Gasulla, Victor Gimenez-Abalos, Pablo Martin-Torres

    Abstract: Convolutional layers are a fundamental component of most image-related models. These layers often implement by default a static padding policy (\eg zero padding), to control the scale of the internal representations, and to allow kernel activations centered on the border regions. In this work we identify Padding Aware Neurons (PANs), a type of filter that is found in most (if not all) convolutiona… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: In 4th Visual Inductive Priors for Data-Efficient Deep Learning Workshop, ICCV 2023

  10. arXiv:2309.03671  [pdf, other

    cs.CV cs.AI cs.LG

    Dataset Generation and Bonobo Classification from Weakly Labelled Videos

    Authors: Pierre-Etienne Martin

    Abstract: This paper presents a bonobo detection and classification pipeline built from the commonly used machine learning methods. Such application is motivated by the need to test bonobos in their enclosure using touch screen devices without human assistance. This work introduces a newly acquired dataset based on bonobo recordings generated semi-automatically. The recordings are weakly labelled and fed to… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: IntelliSys 2023 paper

  11. arXiv:2308.02534  [pdf, other

    cs.CV cs.AI

    Exploring the Role of Explainability in AI-Assisted Embryo Selection

    Authors: Lucia Urcelay, Daniel Hinjos, Pablo A. Martin-Torres, Marta Gonzalez, Marta Mendez, Salva Cívico, Sergio Álvarez-Napagao, Dario Garcia-Gasulla

    Abstract: In Vitro Fertilization is among the most widespread treatments for infertility. One of its main challenges is the evaluation and selection of embryo for implantation, a process with large inter- and intra-clinician variability. Deep learning based methods are gaining attention, but their opaque nature compromises their acceptance in the clinical context, where transparency in the decision making i… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  12. arXiv:2306.11840  [pdf, ps, other

    cs.DC

    A C++20 Interface for MPI 4.0

    Authors: Ali Can Demiralp, Philipp Martin, Niko Sakic, Marcel Krüger, Tim Gerrits

    Abstract: We present a modern C++20 interface for MPI 4.0. The interface utilizes recent language features to ease development of MPI applications. An aggregate reflection system enables generation of MPI data types from user-defined classes automatically. Immediate and persistent operations are mapped to futures, which can be chained to describe sequential asynchronous operations and task graphs in a conci… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: To appear in SC '22: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis

  13. arXiv:2306.02568  [pdf, other

    stat.ML cs.LG

    Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming

    Authors: Xinlei Niu, Christian Walder, Jing Zhang, Charles Patrick Martin

    Abstract: We propose the stochastic optimal path which solves the classical optimal path problem by a probability-softening solution. This unified approach transforms a wide range of DP problems into directed acyclic graphs in which all paths follow a Gibbs distribution. We show the equivalence of the Gibbs distribution to a message-passing algorithm by the properties of the Gumbel distribution and give all… ▽ More

    Submitted 25 June, 2024; v1 submitted 4 June, 2023; originally announced June 2023.

    Comments: Accepted by ICML 2024

  14. arXiv:2305.03176  [pdf

    cs.MM

    NeRF-QA: Neural Radiance Fields Quality Assessment Database

    Authors: Pedro Martin, António Rodrigues, João Ascenso, Maria Paula Queluz

    Abstract: This short paper proposes a new database - NeRF-QA - containing 48 videos synthesized with seven NeRF based methods, along with their perceived quality scores, resulting from subjective assessment tests; for the videos selection, both real and synthetic, 360 degrees scenes were considered. This database will allow to evaluate the suitability, to NeRF based synthesized views, of existing objective… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  15. arXiv:2303.16960  [pdf, ps, other

    math.PR cs.DM math.CO

    Boltzmann Distribution on "Short'' Integer Partitions with Power Parts: Limit Laws and Sampling

    Authors: Jean C. Peyen, Leonid V. Bogachev, Paul P. Martin

    Abstract: The paper is concerned with the asymptotic analysis of a family of Boltzmann (multiplicative) distributions over the set $\check{\varLambda}^{q}$ of strict integer partitions (i.e., with unequal parts) into perfect $q$-th powers. A combinatorial link is provided via a suitable conditioning by fixing the partition weight (the sum of parts) and length (the number of parts), leading to uniform distri… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: 62 pages, 5 figures, 4 tables

    MSC Class: 05A17 (Primary); 05A16; 60C05; 68Q87; 82B10 (Secondary)

  16. arXiv:2302.02755  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Fine-Grained Action Detection with RGB and Pose Information using Two Stream Convolutional Networks

    Authors: Leonard Hacker, Finn Bartels, Pierre-Etienne Martin

    Abstract: As participants of the MediaEval 2022 Sport Task, we propose a two-stream network approach for the classification and detection of table tennis strokes. Each stream is a succession of 3D Convolutional Neural Network (CNN) blocks using attention mechanisms. Each stream processes different 4D inputs. Our method utilizes raw RGB data and pose information computed from MMPose toolbox. The pose informa… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: Working note paper of the sport task of MediaEval 2022 in Bergen, Norway, 12-13 Jan 2023

  17. arXiv:2302.02752  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Baseline Method for the Sport Task of MediaEval 2022 with 3D CNNs using Attention Mechanisms

    Authors: Pierre-Etienne Martin

    Abstract: This paper presents the baseline method proposed for the Sports Video task part of the MediaEval 2022 benchmark. This task proposes two subtasks: stroke classification from trimmed videos, and stroke detection from untrimmed videos. This baseline addresses both subtasks. We propose two types of 3D-CNN architectures to solve the two subtasks. Both 3D-CNNs use Spatio-temporal convolutions and attent… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: Baseline paper for the sport Task of MediaEval 2022

  18. arXiv:2302.00129  [pdf, other

    cs.CL q-bio.NC

    Universal Topological Regularities of Syntactic Structures: Decoupling Efficiency from Optimization

    Authors: Fermín Moscoso del Prado Martín

    Abstract: Human syntactic structures are usually represented as graphs. Much research has focused on the mapping between such graphs and linguistic sequences, but less attention has been paid to the shapes of the graphs themselves: their topologies. This study investigates how the topologies of syntactic graphs reveal traces of the processes that led to their emergence. I report a new universal regularity i… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

    Comments: 30 pages, 7 figures

  19. arXiv:2301.13576  [pdf, other

    cs.AI cs.CV cs.HC cs.LG cs.MM

    Sport Task: Fine Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2022

    Authors: Pierre-Etienne Martin, Jordan Calandre, Boris Mansencal, Jenny Benois-Pineau, Renaud Péteri, Laurent Mascarilla, Julien Morlier

    Abstract: Sports video analysis is a widespread research topic. Its applications are very diverse, like events detection during a match, video summary, or fine-grained movement analysis of athletes. As part of the MediaEval 2022 benchmarking initiative, this task aims at detecting and classifying subtle movements from sport videos. We focus on recordings of table tennis matches. Conducted since 2019, this t… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

    Comments: MediaEval 2022 Workshop, Jan 2023, Bergen, Norway. arXiv admin note: substantial text overlap with arXiv:2112.11384

  20. arXiv:2212.08484  [pdf, other

    cs.NE cs.MA

    Emergent communication enhances foraging behaviour in evolved swarms controlled by Spiking Neural Networks

    Authors: Cristian Jimenez Romero, Alper Yegenoglu, Aarón Pérez Martín, Sandra Diaz-Pier, Abigail Morrison

    Abstract: Social insects such as ants communicate via pheromones which allows them to coordinate their activity and solve complex tasks as a swarm, e.g. foraging for food. This behavior was shaped through evolutionary processes. In computational models, self-coordination in swarms has been implemented using probabilistic or simple action rules to shape the decision of each agent and the collective behavior.… ▽ More

    Submitted 8 September, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

    Comments: 27 pages, 16 figures

  21. arXiv:2210.09291  [pdf, other

    cs.HC

    Embodying the Glitch: Perspectives on Generative AI in Dance Practice

    Authors: Benedikte Wallace, Charles P. Martin

    Abstract: What role does the break from realism play in the potential for generative artificial intelligence as a creative tool? Through exploration of glitch, we examine the prospective value of these artefacts in creative practice. This paper describes findings from an exploration of AI-generated "mistakes" when using movement produced by a generative deep learning model as an inspiration source in dance… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  22. arXiv:2209.14030  [pdf, other

    cs.RO cs.CL cs.FL

    Monitoring ROS2: from Requirements to Autonomous Robots

    Authors: Ivan Perez, Anastasia Mavridou, Tom Pressburger, Alexander Will, Patrick J. Martin

    Abstract: Runtime verification (RV) has the potential to enable the safe operation of safety-critical systems that are too complex to formally verify, such as Robot Operating System 2 (ROS2) applications. Writing correct monitors can itself be complex, and errors in the monitoring subsystem threaten the mission as a whole. This paper provides an overview of a formal approach to generating runtime monitors f… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: In Proceedings FMAS2022 ASYDE2022, arXiv:2209.13181

    ACM Class: D.2.1; D.2.4; I.2.9;

    Journal ref: EPTCS 371, 2022, pp. 208-216

  23. arXiv:2208.02758  [pdf, other

    cs.LG cs.MA math.DS math.NA

    Learning Interaction Variables and Kernels from Observations of Agent-Based Systems

    Authors: Jinchao Feng, Mauro Maggioni, Patrick Martin, Ming Zhong

    Abstract: Dynamical systems across many disciplines are modeled as interacting particles or agents, with interaction rules that depend on a very small number of variables (e.g. pairwise distances, pairwise differences of phases, etc...), functions of the state of pairs of agents. Yet, these interaction rules can generate self-organized dynamics, with complex emergent behaviors (clustering, flocking, swarmin… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

  24. arXiv:2204.08460  [pdf, other

    cs.CV cs.LG eess.IV

    3D Convolutional Networks for Action Recognition: Application to Sport Gesture Recognition

    Authors: Pierre-Etienne Martin, J Benois-Pineau, R Péteri, A Zemmari, J Morlier

    Abstract: 3D convolutional networks is a good means to perform tasks such as video segmentation into coherent spatio-temporal chunks and classification of them with regard to a target taxonomy. In the chapter we are interested in the classification of continuous video takes with repeatable actions, such as strokes of table tennis. Filmed in a free marker less ecological environment, these videos represent a… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: Multi-faceted Deep Learning, 2021

  25. Exploring hyper-parameter spaces of neuroscience models on high performance computers with Learning to Learn

    Authors: Alper Yegenoglu, Anand Subramoney, Thorsten Hater, Cristian Jimenez-Romero, Wouter Klijn, Aaron Perez Martin, Michiel van der Vlag, Michael Herty, Abigail Morrison, Sandra Diaz-Pier

    Abstract: Neuroscience models commonly have a high number of degrees of freedom and only specific regions within the parameter space are able to produce dynamics of interest. This makes the development of tools and strategies to efficiently find these regions of high importance to advance brain research. Exploring the high dimensional parameter space using numerical simulations has been a frequently used te… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

  26. arXiv:2202.09977  [pdf, other

    cs.LG

    RTGNN: A Novel Approach to Model Stochastic Traffic Dynamics

    Authors: Ke Sun, Stephen Chaves, Paul Martin, Vijay Kumar

    Abstract: Modeling stochastic traffic dynamics is critical to developing self-driving cars. Because it is difficult to develop first principle models of cars driven by humans, there is great potential for using data driven approaches in developing traffic dynamical models. While there is extensive literature on this subject, previous works mainly address the prediction accuracy of data-driven models. Moreov… ▽ More

    Submitted 20 February, 2022; originally announced February 2022.

    Comments: Accepted by ICRA 2022

  27. arXiv:2112.12074  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Spatio-Temporal CNN baseline method for the Sports Video Task of MediaEval 2021 benchmark

    Authors: Pierre-Etienne Martin

    Abstract: This paper presents the baseline method proposed for the Sports Video task part of the MediaEval 2021 benchmark. This task proposes a stroke detection and a stroke classification subtasks. This baseline addresses both subtasks. The spatio-temporal CNN architecture and the training process of the model are tailored according to the addressed subtask. The method has the purpose of helping the partic… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Journal ref: MediaEval 2021, Dec 2021, Online, Germany

  28. arXiv:2112.12073  [pdf, other

    cs.CV cs.LG cs.MM

    Two Stream Network for Stroke Detection in Table Tennis

    Authors: Anam Zahra, Pierre-Etienne Martin

    Abstract: This paper presents a table tennis stroke detection method from videos. The method relies on a two-stream Convolutional Neural Network processing in parallel the RGB Stream and its computed optical flow. The method has been developed as part of the MediaEval 2021 benchmark for the Sport task. Our contribution did not outperform the provided baseline on the test set but has performed the best among… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: MediaEval 2021, Dec 2021, Online, Germany

  29. arXiv:2112.11384  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Sports Video: Fine-Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2021

    Authors: Pierre-Etienne Martin, Jordan Calandre, Boris Mansencal, Jenny Benois-Pineau, Renaud Péteri, Laurent Mascarilla, Julien Morlier

    Abstract: Sports video analysis is a prevalent research topic due to the variety of application areas, ranging from multimedia intelligent devices with user-tailored digests up to analysis of athletes' performance. The Sports Video task is part of the MediaEval 2021 benchmark. This task tackles fine-grained action detection and classification from videos. The focus is on recordings of table tennis games. Ru… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: MediaEval 2021, Dec 2021, Online, Germany

  30. arXiv:2110.14690  [pdf, other

    stat.ML cs.LG

    VACA: Design of Variational Graph Autoencoders for Interventional and Counterfactual Queries

    Authors: Pablo Sanchez-Martin, Miriam Rateike, Isabel Valera

    Abstract: In this paper, we introduce VACA, a novel class of variational graph autoencoders for causal inference in the absence of hidden confounders, when only observational data and the causal graph are available. Without making any parametric assumptions, VACA mimics the necessary properties of a Structural Causal Model (SCM) to provide a flexible and practical framework for approximating interventions (… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

  31. arXiv:2109.14306  [pdf, other

    cs.CV cs.AI cs.HC cs.LG cs.MM

    Three-Stream 3D/1D CNN for Fine-Grained Action Classification and Segmentation in Table Tennis

    Authors: Pierre-Etienne Martin, Jenny Benois-Pineau, Renaud Péteri, Julien Morlier

    Abstract: This paper proposes a fusion method of modalities extracted from video through a three-stream network with spatio-temporal and temporal convolutions for fine-grained action classification in sport. It is applied to TTStroke-21 dataset which consists of untrimmed videos of table tennis games. The goal is to detect and classify table tennis strokes in the videos, the first step of a bigger scheme ai… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

    Comments: MMSports '21, October 20, 2021, Virtual Event,, Oct 2021, Chengdu, China

  32. arXiv:2107.13386  [pdf, other

    cs.AR

    SPOTS: An Accelerator for Sparse Convolutional Networks Leveraging Systolic General Matrix-Matrix Multiplication

    Authors: Mohammadreza Soltaniyeh, Richard P. Martin, Santosh Nagarakatte

    Abstract: This paper proposes a new hardware accelerator for sparse convolutional neural networks (CNNs) by building a hardware unit to perform the Image to Column (IM2COL) transformation of the input feature map coupled with a systolic array-based general matrix-matrix multiplication (GEMM) unit. Our design carefully overlaps the IM2COL transformation with the GEMM computation to maximize parallelism. We p… ▽ More

    Submitted 24 November, 2021; v1 submitted 28 July, 2021; originally announced July 2021.

    Comments: 24 pages

    Report number: Rutgers Department of Computer Science Technical Report DCS-TR-756

  33. arXiv:2105.06166  [pdf, ps, other

    cs.DS

    The Dynamic k-Mismatch Problem

    Authors: Raphaël Clifford, Paweł Gawrychowski, Tomasz Kociumaka, Daniel P. Martin, Przemysław Uznański

    Abstract: The text-to-pattern Hamming distances problem asks to compute the Hamming distances between a given pattern of length $m$ and all length-$m$ substrings of a given text of length $n\ge m$. We focus on the $k$-mismatch version of the problem, where a distance needs to be returned only if it does not exceed a threshold $k$. We assume $n\le 2m$ (in general, one can partition the text into overlapping… ▽ More

    Submitted 28 March, 2022; v1 submitted 13 May, 2021; originally announced May 2021.

  34. arXiv:2012.05342  [pdf, other

    cs.CV cs.HC cs.LG cs.MM

    3D attention mechanism for fine-grained classification of table tennis strokes using a Twin Spatio-Temporal Convolutional Neural Networks

    Authors: Pierre-Etienne Martin, Jenny Benois-Pineau, Renaud Péteri, Julien Morlier

    Abstract: The paper addresses the problem of recognition of actions in video with low inter-class variability such as Table Tennis strokes. Two stream, "twin" convolutional neural networks are used with 3D convolutions both on RGB data and optical flow. Actions are recognized by classification of temporal windows. We introduce 3D attention modules and examine their impact on classification efficiency. In th… ▽ More

    Submitted 20 November, 2020; originally announced December 2020.

    Journal ref: 25th International Conference on Pattern Recognition (ICPR2020), Jan 2021, Milano, Italy

  35. Composing an Ensemble Standstill Work for Myo and Bela

    Authors: Charles Patrick Martin, Alexander Refsum Jensenius, Jim Torresen

    Abstract: This paper describes the process of developing a standstill performance work using the Myo gesture control armband and the Bela embedded computing platform. The combination of Myo and Bela allows a portable and extensible version of the standstill performance concept while introducing muscle tension as an additional control parameter. We describe the technical details of our setup and introduce My… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

    ACM Class: H.5.5

    Journal ref: Proceedings of the International Conference on New Interfaces for Musical Expression, 2018, pp. 196-197

  36. arXiv:2012.02322  [pdf, other

    cs.HC cs.SD eess.AS

    A Laptop Ensemble Performance System using Recurrent Neural Networks

    Authors: Rohan Proctor, Charles Patrick Martin

    Abstract: The popularity of applying machine learning techniques in musical domains has created an inherent availability of freely accessible pre-trained neural network (NN) models ready for use in creative applications. This work outlines the implementation of one such application in the form of an assistance tool designed for live improvisational performances by laptop ensembles. The primary intention was… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    ACM Class: H.5.5; H.5.3

    Journal ref: Proceedings of the International Conference on New Interfaces for Musical Expression, 2020, pp. 43-48

  37. arXiv:2012.02311  [pdf, other

    cs.HC cs.SD eess.AS

    Sonic Sculpture: Activating Engagement with Head-Mounted Augmented Reality

    Authors: Charles Patrick Martin, Zeruo Liu, Yichen Wang, Wennan He, Henry Gardner

    Abstract: This work examines how head-mounted AR can be used to build an interactive sonic landscape to engage with a public sculpture. We describe a sonic artwork, "Listening To Listening", that has been designed to accompany a real-world sculpture with two prototype interaction schemes. Our artwork is created for the HoloLens platform so that users can have an individual experience in a mixed reality cont… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    ACM Class: H.5.5; H.5.1

    Journal ref: Proceedings of the International Conference on New Interfaces for Musical Expression, 2020, pp. 48-52

  38. arXiv:2011.13453  [pdf, other

    cs.SD eess.AS

    Towards Movement Generation with Audio Features

    Authors: Benedikte Wallace, Charles P. Martin, Jim Torresen, Kristian Nymoen

    Abstract: Sound and movement are closely coupled, particularly in dance. Certain audio features have been found to affect the way we move to music. Is this relationship between sound and movement something which can be modelled using machine learning? This work presents initial experiments wherein high-level audio features calculated from a set of music pieces are included in a movement generation model tra… ▽ More

    Submitted 26 November, 2020; originally announced November 2020.

  39. arXiv:2010.02663  [pdf, other

    cs.MA cs.AI

    Heterogeneous Multi-Agent Reinforcement Learning for Unknown Environment Mapping

    Authors: Ceyer Wakilpoor, Patrick J. Martin, Carrie Rebhuhn, Amanda Vu

    Abstract: Reinforcement learning in heterogeneous multi-agent scenarios is important for real-world applications but presents challenges beyond those seen in homogeneous settings and simple benchmarks. In this work, we present an actor-critic algorithm that allows a team of heterogeneous agents to learn decentralized control policies for covering an unknown environment. This task is of interest to national… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: Presented at AAAI FSS-20: Artificial Intelligence in Government and Public Sector, Washington, DC, USA. 8 pages, 6 figures

  40. arXiv:2007.05794  [pdf, other

    cs.RO

    Feedback Enhanced Motion Planning for Autonomous Vehicles

    Authors: Ke Sun, Brent Schlotfeldt, Stephen Chaves, Paul Martin, Gulshan Mandhyan, Vijay Kumar

    Abstract: In this work, we address the motion planning problem for autonomous vehicles through a new lattice planning approach, called Feedback Enhanced Lattice Planner (FELP). Existing lattice planners have two major limitations, namely the high dimensionality of the lattice and the lack of modeling of agent vehicle behaviors. We propose to apply the Intelligent Driver Model (IDM) as a speed feedback polic… ▽ More

    Submitted 11 July, 2020; originally announced July 2020.

    Comments: To appear in IROS 2020

  41. arXiv:2007.05149  [pdf, other

    eess.IV cs.CV cs.LG

    Localized Motion Artifact Reduction on Brain MRI Using Deep Learning with Effective Data Augmentation Techniques

    Authors: Yijun Zhao, Jacek Ossowski, Xuming Wang, Shangjin Li, Orrin Devinsky, Samantha P. Martin, Heath R. Pardoe

    Abstract: In-scanner motion degrades the quality of magnetic resonance imaging (MRI) thereby reducing its utility in the detection of clinically relevant abnormalities. We introduce a deep learning-based MRI artifact reduction model (DMAR) to localize and correct head motion artifacts in brain MRI scans. Our approach integrates the latest advances in object detection and noise reduction in Computer Vision.… ▽ More

    Submitted 30 October, 2020; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: 11 pages, 8 figures

  42. arXiv:2005.14645  [pdf, other

    cs.CR

    DatashareNetwork: A Decentralized Privacy-Preserving Search Engine for Investigative Journalists

    Authors: Kasra EdalatNejad, Wouter Lueks, Julien Pierre Martin, Soline Ledésert, Anne L'Hôte, Bruno Thomas, Laurent Girod, Carmela Troncoso

    Abstract: Investigative journalists collect large numbers of digital documents during their investigations. These documents can greatly benefit other journalists' work. However, many of these documents contain sensitive information. Hence, possessing such documents can endanger reporters, their stories, and their sources. Consequently, many documents are used only for single, local, investigations. We pre… ▽ More

    Submitted 30 July, 2020; v1 submitted 29 May, 2020; originally announced May 2020.

    Journal ref: USENIX Security Symposium 2020: 1911-1927

  43. arXiv:2004.13907  [pdf, other

    cs.DC cs.MS cs.PL

    Synergistic CPU-FPGA Acceleration of Sparse Linear Algebra

    Authors: Mohammadreza Soltaniyeh, Richard P. Martin, Santosh Nagarakatte

    Abstract: This paper describes REAP, a software-hardware approach that enables high performance sparse linear algebra computations on a cooperative CPU-FPGA platform. REAP carefully separates the task of organizing the matrix elements from the computation phase. It uses the CPU to provide a first-pass re-organization of the matrix elements, allowing the FPGA to focus on the computation. We introduce a new i… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

    Comments: 12 pages

    Report number: Rutgers Computer Science Technical Report DCS-TR-750

  44. arXiv:2003.13254  [pdf, other

    cs.RO cs.NE

    Environmental Adaptation of Robot Morphology and Control through Real-world Evolution

    Authors: Tønnes F. Nygaard, Charles P. Martin, David Howard, Jim Torresen, Kyrre Glette

    Abstract: Robots operating in the real world will experience a range of different environments and tasks. It is essential for the robot to have the ability to adapt to its surroundings to work efficiently in changing conditions. Evolutionary robotics aims to solve this by optimizing both the control and body (morphology) of a robot, allowing adaptation to internal, as well as external factors. Most work in… ▽ More

    Submitted 20 October, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

  45. Towards a Framework for the Design, Implementation and Reporting of Methodology Scoping Reviews

    Authors: Glen P. Martin, David Jenkins, Lucy Bull, Rose Sisk, Lijing Lin, William Hulme, Anthony Wilson, Wenjuan Wang, Michael Barrowman, Camilla Sammut-Powell, Alexander Pate, Matthew Sperrin, Niels Peek

    Abstract: Background: In view of the growth of published papers, there is an increasing need for studies that summarise scientific research. An increasingly common review is a 'Methodology scoping review', which provides a summary of existing analytical methods, techniques and software, proposed or applied in research articles, which address an analytical problem or further an analytical approach. However,… ▽ More

    Submitted 16 January, 2020; originally announced January 2020.

    Comments: 22 pages, 2 tables

    Journal ref: Journal of Clinical Epidemiology. (2020)

  46. arXiv:1911.01425  [pdf, other

    stat.ML cs.CV cs.LG

    Improved BiGAN training with marginal likelihood equalization

    Authors: Pablo Sánchez-Martín, Pablo M. Olmos, Fernando Perez-Cruz

    Abstract: We propose a novel training procedure for improving the performance of generative adversarial networks (GANs), especially to bidirectional GANs. First, we enforce that the empirical distribution of the inverse inference network matches the prior distribution, which favors the generator network reproducibility on the seen samples. Second, we have found that the marginal log-likelihood of the sample… ▽ More

    Submitted 23 May, 2020; v1 submitted 4 November, 2019; originally announced November 2019.

  47. arXiv:1906.08362  [pdf, other

    cs.AI

    Trepan Reloaded: A Knowledge-driven Approach to Explaining Artificial Neural Networks

    Authors: Roberto Confalonieri, Tillman Weyde, Tarek R. Besold, Fermín Moscoso del Prado Martín

    Abstract: Explainability in Artificial Intelligence has been revived as a topic of active research by the need of conveying safety and trust to users in the `how' and `why' of automated decision-making. Whilst a plethora of approaches have been developed for post-hoc explainability, only a few focus on how to use domain knowledge, and how this influences the understandability of global explanations from the… ▽ More

    Submitted 21 November, 2019; v1 submitted 19 June, 2019; originally announced June 2019.

  48. arXiv:1905.05626  [pdf, other

    cs.RO

    Lessons Learned from Real-World Experiments with DyRET: the Dynamic Robot for Embodied Testing

    Authors: Tønnes F. Nygaard, Jørgen Nordmoen, Charles P. Martin, Kyrre Glette

    Abstract: Robots are used in more and more complex environments, and are expected to be able to adapt to changes and unknown situations. The easiest and quickest way to adapt is to change the control system of the robot, but for increasingly complex environments one should also change the body of the robot -- its morphology -- to better fit the task at hand. The theory of Embodied Cognition states that cont… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: Accepted to the Learning Legged Locomotion Workshop @ ICRA 2019

  49. arXiv:1905.01254  [pdf, ps, other

    cs.DS

    RLE edit distance in near optimal time

    Authors: Raphaël Clifford, Paweł Gawrychowski, Tomasz Kociumaka, Daniel P. Martin, Przemysław Uznański

    Abstract: We show that the edit distance between two run-length encoded strings of compressed lengths $m$ and $n$ respectively, can be computed in $\mathcal{O}(mn\log(mn))$ time. This improves the previous record by a factor of $\mathcal{O}(n/\log(mn))$. The running time of our algorithm is within subpolynomial factors of being optimal, subject to the standard SETH-hardness assumption. This effectively clos… ▽ More

    Submitted 3 May, 2019; originally announced May 2019.

  50. Recommending research articles to consumers of online vaccination information

    Authors: Eliza Harrison, Paige Martin, Didi Surian, Adam G. Dunn

    Abstract: Online health communications often provide biased interpretations of evidence and have unreliable links to the source research. We tested the feasibility of a tool for matching webpages to their source evidence. From 207,538 eligible vaccination-related PubMed articles, we evaluated several approaches using 3,573 unique links to webpages from Altmetric. We evaluated methods for ranking the source… ▽ More

    Submitted 19 August, 2020; v1 submitted 26 April, 2019; originally announced April 2019.

    Comments: 12 pages, 5 figures, 2 tables

    ACM Class: H.3.3

    Journal ref: Quantitative Science Studies, 1(2):810-823 (2020)