Skip to main content

Showing 1–50 of 86 results for author: Romero, A

  1. arXiv:2407.01705  [pdf

    cs.CV cs.AI

    Optimized Learning for X-Ray Image Classification for Multi-Class Disease Diagnoses with Accelerated Computing Strategies

    Authors: Sebastian A. Cruz Romero, Ivanelyz Rivera de Jesus, Dariana J. Troche Quinones, Wilson Rivera Gallego

    Abstract: X-ray image-based disease diagnosis lies in ensuring the precision of identifying afflictions within the sample, a task fraught with challenges stemming from the occurrence of false positives and false negatives. False positives introduce the risk of erroneously identifying non-existent conditions, leading to misdiagnosis and a decline in patient care quality. Conversely, false negatives pose the… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: High Performance Computing course final term paper

  2. arXiv:2406.12505  [pdf, other

    cs.RO

    Demonstrating Agile Flight from Pixels without State Estimation

    Authors: Ismail Geles, Leonard Bauersfeld, Angel Romero, Jiaxu Xing, Davide Scaramuzza

    Abstract: Quadrotors are among the most agile flying robots. Despite recent advances in learning-based control and computer vision, autonomous drones still rely on explicit state estimation. On the other hand, human pilots only rely on a first-person-view video stream from the drone onboard camera to push the platform to its limits and fly robustly in unseen environments. To the best of our knowledge, we pr… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Journal ref: Robotics: Science and Systems (RSS), 2024

  3. arXiv:2404.15029  [pdf, other

    cs.LG

    Explainable LightGBM Approach for Predicting Myocardial Infarction Mortality

    Authors: Ana Letícia Garcez Vicente, Roseval Donisete Malaquias Junior, Roseli A. F. Romero

    Abstract: Myocardial Infarction is a main cause of mortality globally, and accurate risk prediction is crucial for improving patient outcomes. Machine Learning techniques have shown promise in identifying high-risk patients and predicting outcomes. However, patient data often contain vast amounts of information and missing values, posing challenges for feature selection and imputation methods. In this artic… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: This article has been accepted at the 2023 International Conference on Computational Science and Computational Intelligence (CSCI 23)

  4. arXiv:2403.17551  [pdf, other

    cs.RO

    MPCC++: Model Predictive Contouring Control for Time-Optimal Flight with Safety Constraints

    Authors: Maria Krinner, Angel Romero, Leonard Bauersfeld, Melanie Zeilinger, Andrea Carron, Davide Scaramuzza

    Abstract: Quadrotor flight is an extremely challenging problem due to the limited control authority encountered at the limit of handling. Model Predictive Contouring Control (MPCC) has emerged as a promising model-based approach for time optimization problems such as drone racing. However, the standard MPCC formulation used in quadrotor racing introduces the notion of the gates directly in the cost function… ▽ More

    Submitted 14 June, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: 12 pages, 6 figures

    Journal ref: Robotics: Science and Systems (RSS), 2024

  5. arXiv:2403.12203  [pdf, other

    cs.RO cs.CV cs.LG

    Bootstrapping Reinforcement Learning with Imitation for Vision-Based Agile Flight

    Authors: Jiaxu Xing, Angel Romero, Leonard Bauersfeld, Davide Scaramuzza

    Abstract: We combine the effectiveness of Reinforcement Learning (RL) and the efficiency of Imitation Learning (IL) in the context of vision-based, autonomous drone racing. We focus on directly processing visual input without explicit state estimation. While RL offers a general framework for learning complex controllers through trial and error, it faces challenges regarding sample efficiency and computation… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  6. arXiv:2403.09177  [pdf, other

    cs.RO cs.NI

    Cellular-enabled Collaborative Robots Planning and Operations for Search-and-Rescue Scenarios

    Authors: Arnau Romero, Carmen Delgado, Lanfranco Zanzi, Raúl Suárez, Xavier Costa-Pérez

    Abstract: Mission-critical operations, particularly in the context of Search-and-Rescue (SAR) and emergency response situations, demand optimal performance and efficiency from every component involved to maximize the success probability of such operations. In these settings, cellular-enabled collaborative robotic systems have emerged as invaluable assets, assisting first responders in several tasks, ranging… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  7. arXiv:2403.02514  [pdf, other

    cs.RO cs.AI cs.LG

    Purpose for Open-Ended Learning Robots: A Computational Taxonomy, Definition, and Operationalisation

    Authors: Gianluca Baldassarre, Richard J. Duro, Emilio Cartoni, Mehdi Khamassi, Alejandro Romero, Vieri Giuliano Santucci

    Abstract: Autonomous open-ended learning (OEL) robots are able to cumulatively acquire new skills and knowledge through direct interaction with the environment, for example relying on the guidance of intrinsic motivations and self-generated goals. OEL robots have a high relevance for applications as they can use the autonomously acquired knowledge to accomplish tasks relevant for their human users. OEL robo… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 15 pages, 6 figures

  8. arXiv:2311.17068  [pdf, other

    cs.CE cs.LG

    Deep convolutional encoder-decoder hierarchical neural networks for conjugate heat transfer surrogate modeling

    Authors: Takiah Ebbs-Picken, David A. Romero, Carlos M. Da Silva, Cristina H. Amon

    Abstract: Conjugate heat transfer (CHT) models are vital for the design of many engineering systems. However, high-fidelity CHT models are computationally intensive, which limits their use in applications such as design optimization, where hundreds to thousands of model evaluations are required. In this work, we develop a modular deep convolutional encoder-decoder hierarchical (DeepEDH) neural network, a no… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  9. Reaching the Limit in Autonomous Racing: Optimal Control versus Reinforcement Learning

    Authors: Yunlong Song, Angel Romero, Matthias Mueller, Vladlen Koltun, Davide Scaramuzza

    Abstract: A central question in robotics is how to design a control system for an agile mobile robot. This paper studies this question systematically, focusing on a challenging setting: autonomous drone racing. We show that a neural network controller trained with reinforcement learning (RL) outperformed optimal control (OC) methods in this setting. We then investigated which fundamental factors have contri… ▽ More

    Submitted 18 October, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

    Journal ref: Science Robotics, 2023

  10. arXiv:2309.12784  [pdf, other

    cs.RO

    Learning to Walk and Fly with Adversarial Motion Priors

    Authors: Giuseppe L'Erario, Drew Hanover, Angel Romero, Yunlong Song, Gabriele Nava, Paolo Maria Viceconte, Daniele Pucci, Davide Scaramuzza

    Abstract: Robot multimodal locomotion encompasses the ability to transition between walking and flying, representing a significant challenge in robotics. This work presents an approach that enables automatic smooth transitions between legged and aerial locomotion. Leveraging the concept of Adversarial Motion Priors, our method allows the robot to imitate motion datasets and accomplish the desired task witho… ▽ More

    Submitted 29 March, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: 8 pages, 8 figures, submitted to IROS 2024

  11. Agilicious: Open-Source and Open-Hardware Agile Quadrotor for Vision-Based Flight

    Authors: Philipp Foehn, Elia Kaufmann, Angel Romero, Robert Penicka, Sihao Sun, Leonard Bauersfeld, Thomas Laengle, Giovanni Cioffi, Yunlong Song, Antonio Loquercio, Davide Scaramuzza

    Abstract: Autonomous, agile quadrotor flight raises fundamental challenges for robotics research in terms of perception, planning, learning, and control. A versatile and standardized platform is needed to accelerate research and let practitioners focus on the core problems. To this end, we present Agilicious, a co-designed hardware and software framework tailored to autonomous, agile quadrotor flight. It is… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: 14 pages, 5 figures, 2 tables

    Journal ref: Science Robotics Vol. 7, Issue 67, 2022

  12. arXiv:2306.09852  [pdf, other

    cs.RO

    Actor-Critic Model Predictive Control

    Authors: Angel Romero, Yunlong Song, Davide Scaramuzza

    Abstract: An open research question in robotics is how to combine the benefits of model-free reinforcement learning (RL) - known for its strong task performance and flexibility in optimizing general reward formulations - with the robustness and online replanning capabilities of model predictive control (MPC). This paper provides an answer by introducing a new framework called Actor-Critic Model Predictive C… ▽ More

    Submitted 12 April, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: 6 pages, 5 figures

    Journal ref: IEEE Conference on Robotics and Automation (ICRA 2024)

  13. arXiv:2305.07128  [pdf, other

    physics.optics cs.CV

    Pixel-wise rational model for structured light system

    Authors: Raúl Vargas, Lenny A. Romero, Song Zhang, Andres G. Marrugo

    Abstract: This Letter presents a novel structured light system model that effectively considers local lens distortion by pixel-wise rational functions. We leverage the stereo method for initial calibration and then estimate the rational model for each pixel. Our proposed model can achieve high measurement accuracy within and outside the calibration volume, demonstrating its robustness and accuracy.

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: 4 pages, 5 figures

    Journal ref: Optics Letters, Vol. 48, No. 10, 2023

  14. arXiv:2303.10239  [pdf, other

    cond-mat.other cs.DL cs.IR physics.soc-ph

    Topic Modeling in Density Functional Theory on Citations of Condensed Matter Electronic Structure Packages

    Authors: Marie Dumaz, Camila Romero-Bohorquez, Donald Adjeroh, Aldo H. Romero

    Abstract: With an increasing number of new scientific papers being released, it becomes harder for researchers to be aware of recent articles in their field of study. Accurately classifying papers is a first step in the direction of personalized catering and easy access to research of interest. The field of Density Functional Theory (DFT) in particular is a good example of a methodology used in very differe… ▽ More

    Submitted 16 February, 2023; originally announced March 2023.

  15. Autonomous Drone Racing: A Survey

    Authors: Drew Hanover, Antonio Loquercio, Leonard Bauersfeld, Angel Romero, Robert Penicka, Yunlong Song, Giovanni Cioffi, Elia Kaufmann, Davide Scaramuzza

    Abstract: Over the last decade, the use of autonomous drone systems for surveying, search and rescue, or last-mile delivery has increased exponentially. With the rise of these applications comes the need for highly robust, safety-critical algorithms which can operate drones in complex and uncertain environments. Additionally, flying fast enables drones to cover more ground which in turn increases productivi… ▽ More

    Submitted 8 July, 2024; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: 26 pages

    Journal ref: IEEE Transactions on Robotics (T-RO), Vol. 40, 2024

  16. Cracking Double-Blind Review: Authorship Attribution with Deep Learning

    Authors: Leonard Bauersfeld, Angel Romero, Manasi Muglikar, Davide Scaramuzza

    Abstract: Double-blind peer review is considered a pillar of academic research because it is perceived to ensure a fair, unbiased, and fact-centered scientific discussion. Yet, experienced researchers can often correctly guess from which research group an anonymous submission originates, biasing the peer-review process. In this work, we present a transformer-based, neural-network architecture that only uses… ▽ More

    Submitted 3 July, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: 13 pages + 3 pages references

    Journal ref: PLOS ONE 18(6): e0287611 (2023)

  17. arXiv:2210.11087  [pdf, other

    cs.RO

    Weighted Maximum Likelihood for Controller Tuning

    Authors: Angel Romero, Shreedhar Govil, Gonca Yilmaz, Yunlong Song, Davide Scaramuzza

    Abstract: Recently, Model Predictive Contouring Control (MPCC) has arisen as the state-of-the-art approach for model-based agile flight. MPCC benefits from great flexibility in trading-off between progress maximization and path following at runtime without relying on globally optimized trajectories. However, finding the optimal set of tuning parameters for MPCC is challenging because (i) the full quadrotor… ▽ More

    Submitted 2 March, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: 8 pages

    Journal ref: IEEE Conference on Robotics and Automation (ICRA 2023)

  18. arXiv:2210.07102  [pdf, other

    eess.IV cs.CV cs.LG

    Corneal endothelium assessment in specular microscopy images with Fuchs' dystrophy via deep regression of signed distance maps

    Authors: Juan S. Sierra, Jesus Pineda, Daniela Rueda, Alejandro Tello, Angelica M. Prada, Virgilio Galvis, Giovanni Volpe, Maria S. Millan, Lenny A. Romero, Andres G. Marrugo

    Abstract: Specular microscopy assessment of the human corneal endothelium (CE) in Fuchs' dystrophy is challenging due to the presence of dark image regions called guttae. This paper proposes a UNet-based segmentation approach that requires minimal post-processing and achieves reliable CE morphometric assessment and guttae identification across all degrees of Fuchs' dystrophy. We cast the segmentation proble… ▽ More

    Submitted 29 November, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

  19. arXiv:2205.07562  [pdf, other

    cs.LG cs.AI cs.RO

    Autonomous Open-Ended Learning of Tasks with Non-Stationary Interdependencies

    Authors: Alejandro Romero, Gianluca Baldassarre, Richard J. Duro, Vieri Giuliano Santucci

    Abstract: Autonomous open-ended learning is a relevant approach in machine learning and robotics, allowing the design of artificial agents able to acquire goals and motor skills without the necessity of user assigned tasks. A crucial issue for this approach is to develop strategies to ensure that agents can maximise their competence on as many tasks as possible in the shortest possible time. Intrinsic motiv… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: Submitted and accepted to "The Multi-disciplinary Conference on Reinforcement Learning and Decision Making" RLDM 2022

  20. OROS: Online Operation and Orchestration of Collaborative Robots using 5G

    Authors: Arnau Romero, Carmen Delgado, Lanfranco Zanzi, Xi Li, Xavier Costa-Pérez

    Abstract: The 5G mobile networks extend the capability for supporting collaborative robot operations in outdoor scenarios. However, the restricted battery life of robots still poses a major obstacle to their effective implementation and utilization in real scenarios. One of the most challenging situations is the execution of mission-critical tasks that require the use of various onboard sensors to perform s… ▽ More

    Submitted 1 February, 2024; v1 submitted 6 May, 2022; originally announced May 2022.

    Journal ref: IEEE Transactions on Network and Service Management 2023

  21. Time-Optimal Online Replanning for Agile Quadrotor Flight

    Authors: Angel Romero, Robert Penicka, Davide Scaramuzza

    Abstract: In this paper, we tackle the problem of flying a quadrotor using time-optimal control policies that can be replanned online when the environment changes or when encountering unknown disturbances. This problem is challenging as the time-optimal trajectories that consider the full quadrotor dynamics are computationally expensive to generate (order of minutes or even hours). We introduce a sampling-b… ▽ More

    Submitted 21 July, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: 8 pages, 10 figures

    Journal ref: IEEE Robotics and Automation Letters, vol. 7, no. 3, pp. 7730-7737, July 2022

  22. Perception-Aware Perching on Powerlines with Multirotors

    Authors: Julio L. Paneque, Jose Ramiro Martínez de Dios, Aníbal Ollero. Drew Hanover, Sihao Sun, Ángel Romero, Davide Scaramuzza

    Abstract: Multirotor aerial robots are becoming widely used for the inspection of powerlines. To enable continuous, robust inspection without human intervention, the robots must be able to perch on the powerlines to recharge their batteries. Highly versatile perching capabilities are necessary to adapt to the variety of configurations and constraints that are present in real powerline systems. This paper pr… ▽ More

    Submitted 13 February, 2022; originally announced February 2022.

    Comments: IEEE Robotics and Automation Letters (2022)

  23. arXiv:2201.09865  [pdf, other

    cs.CV

    RePaint: Inpainting using Denoising Diffusion Probabilistic Models

    Authors: Andreas Lugmayr, Martin Danelljan, Andres Romero, Fisher Yu, Radu Timofte, Luc Van Gool

    Abstract: Free-form inpainting is the task of adding new content to an image in the regions specified by an arbitrary binary mask. Most existing approaches train for a certain distribution of masks, which limits their generalization capabilities to unseen mask types. Furthermore, training with pixel-wise and perceptual losses often leads to simple textural extensions towards the missing areas instead of sem… ▽ More

    Submitted 31 August, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

    Comments: We missed out on other diffusion models that work on inpainting. We corrected that and apologize for this mistake

  24. Large-scale Autonomous Flight with Real-time Semantic SLAM under Dense Forest Canopy

    Authors: Xu Liu, Guilherme V. Nardari, Fernando Cladera Ojeda, Yuezhan Tao, Alex Zhou, Thomas Donnelly, Chao Qu, Steven W. Chen, Roseli A. F. Romero, Camillo J. Taylor, Vijay Kumar

    Abstract: Semantic maps represent the environment using a set of semantically meaningful objects. This representation is storage-efficient, less ambiguous, and more informative, thus facilitating large-scale autonomy and the acquisition of actionable information in highly unstructured, GPS-denied environments. In this letter, we propose an integrated system that can perform large-scale autonomous flights an… ▽ More

    Submitted 15 August, 2023; v1 submitted 14 September, 2021; originally announced September 2021.

    Comments: Xu Liu and Guilherme V. Nardari contributed equally to this work

    Journal ref: IEEE Robotics and Automation Letters ( Volume: 7, Issue: 2, April 2022)

  25. arXiv:2109.01365  [pdf, other

    cs.RO

    A Comparative Study of Nonlinear MPC and Differential-Flatness-Based Control for Quadrotor Agile Flight

    Authors: Sihao Sun, Angel Romero, Philipp Foehn, Elia Kaufmann, Davide Scaramuzza

    Abstract: Accurate trajectory tracking control for quadrotors is essential for safe navigation in cluttered environments. However, this is challenging in agile flights due to nonlinear dynamics, complex aerodynamic effects, and actuation constraints. In this article, we empirically compare two state-of-the-art control frameworks: the nonlinear-model-predictive controller (NMPC) and the differential-flatness… ▽ More

    Submitted 4 January, 2024; v1 submitted 3 September, 2021; originally announced September 2021.

    Journal ref: The paper has been published in the IEEE Transactions on Robotics (T-RO), 2022

  26. arXiv:2108.13205  [pdf, other

    cs.RO

    Model Predictive Contouring Control for Time-Optimal Quadrotor Flight

    Authors: Angel Romero, Sihao Sun, Philipp Foehn, Davide Scaramuzza

    Abstract: We tackle the problem of flying time-optimal trajectories through multiple waypoints with quadrotors. State-of-the-art solutions split the problem into a planning task - where a global, time-optimal trajectory is generated - and a control task - where this trajectory is accurately tracked. However, at the current state, generating a time-optimal trajectory that considers the full quadrotor model r… ▽ More

    Submitted 4 May, 2022; v1 submitted 30 August, 2021; originally announced August 2021.

    Comments: 17 pages, 16 figures. Video: https://www.youtube.com/watch?v=mHDQcckqdg4 This paper has been accepted for publication in the IEEE Transactions on Robotics (T-RO), 2022

  27. arXiv:2108.11505  [pdf, other

    eess.IV cs.CV cs.LG

    Generalized Real-World Super-Resolution through Adversarial Robustness

    Authors: Angela Castillo, María Escobar, Juan C. Pérez, Andrés Romero, Radu Timofte, Luc Van Gool, Pablo Arbeláez

    Abstract: Real-world Super-Resolution (SR) has been traditionally tackled by first learning a specific degradation model that resembles the noise and corruption artifacts in low-resolution imagery. Thus, current methods lack generalization and lose their accuracy when tested on unseen types of corruption. In contrast to the traditional proposal, we present Robust Super-Resolution (RSR), a method that levera… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

    Comments: ICCV Workshops, 2021

  28. arXiv:2108.04537  [pdf, other

    cs.RO cs.AI eess.SY

    Time-Optimal Planning for Quadrotor Waypoint Flight

    Authors: Philipp Foehn, Angel Romero, Davide Scaramuzza

    Abstract: Quadrotors are among the most agile flying robots. However, planning time-optimal trajectories at the actuation limit through multiple waypoints remains an open problem. This is crucial for applications such as inspection, delivery, search and rescue, and drone racing. Early works used polynomial trajectory formulations, which do not exploit the full actuator potential because of their inherent sm… ▽ More

    Submitted 1 October, 2021; v1 submitted 10 August, 2021; originally announced August 2021.

    Comments: Narrated video footage available at https://youtu.be/ZPI8U1uSJUs. Code available at https://github.com/uzh-rpg/rpg_time_optimal. arXiv admin note: text overlap with arXiv:2007.06255

    Journal ref: Published in Science Robotics, 21 Jul 2021, Vol. 6, Issue 56

  29. A Neurorobotics Approach to Behaviour Selection based on Human Activity Recognition

    Authors: Caetano M. Ranieri, Renan C. Moioli, Patricia A. Vargas, Roseli A. F. Romero

    Abstract: Behaviour selection has been an active research topic for robotics, in particular in the field of human-robot interaction. For a robot to interact effectively and autonomously with humans, the coupling between techniques for human activity recognition, based on sensing information, and robot behaviour selection, based on decision-making mechanisms, is of paramount importance. However, most approac… ▽ More

    Submitted 27 September, 2022; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this article is published in Cognitive Neurodynamics (2022), and is available online at https://doi.org/10.1007/s11571-022-09886-z

  30. A Data-Driven Biophysical Computational Model of Parkinson's Disease based on Marmoset Monkeys

    Authors: Caetano M. Ranieri, Jhielson M. Pimentel, Marcelo R. Romano, Leonardo A. Elias, Roseli A. F. Romero, Michael A. Lones, Mariana F. P. Araujo, Patricia A. Vargas, Renan C. Moioli

    Abstract: In this work we propose a new biophysical computational model of brain regions relevant to Parkinson's Disease based on local field potential data collected from the brain of marmoset monkeys. Parkinson's disease is a neurodegenerative disorder, linked to the death of dopaminergic neurons at the substantia nigra pars compacta, which affects the normal dynamics of the basal ganglia-thalamus-cortex… ▽ More

    Submitted 1 September, 2021; v1 submitted 26 July, 2021; originally announced July 2021.

    Journal ref: IEEE Access, 2021

  31. arXiv:2107.10495  [pdf

    cs.LG

    Benchmarking AutoML Frameworks for Disease Prediction Using Medical Claims

    Authors: Roland Albert A. Romero, Mariefel Nicole Y. Deypalan, Suchit Mehrotra, John Titus Jungao, Natalie E. Sheils, Elisabetta Manduchi, Jason H. Moore

    Abstract: We ascertain and compare the performances of AutoML tools on large, highly imbalanced healthcare datasets. We generated a large dataset using historical administrative claims including demographic information and flags for disease codes in four different time windows prior to 2019. We then trained three AutoML tools on this dataset to predict six different disease outcomes in 2019 and evaluated… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: 22 pages, 8 figures, 7 tables

  32. arXiv:2107.09584  [pdf, other

    cs.CV cs.RO

    Active 3D Shape Reconstruction from Vision and Touch

    Authors: Edward J. Smith, David Meger, Luis Pineda, Roberto Calandra, Jitendra Malik, Adriana Romero, Michal Drozdzal

    Abstract: Humans build 3D understandings of the world through active object exploration, using jointly their senses of vision and touch. However, in 3D shape reconstruction, most recent progress has relied on static datasets of limited sensory data such as RGB images, depth maps or haptic readings, leaving the active exploration of the shape largely unexplored. Inactive touch sensing for 3D reconstruction,… ▽ More

    Submitted 26 October, 2021; v1 submitted 20 July, 2021; originally announced July 2021.

    Journal ref: Published at Neurips 2021

  33. Three-dimensional multimodal medical imaging system based on free-hand ultrasound and structured light

    Authors: Jhacson Meza, Sonia H. Contreras-Ortiz, Lenny A. Romero, Andres G. Marrugo

    Abstract: We propose a three-dimensional (3D) multimodal medical imaging system that combines freehand ultrasound and structured light 3D reconstruction in a single coordinate system without requiring registration. To the best of our knowledge, these techniques have not been combined before as a multimodal imaging technique. The system complements the internal 3D information acquired with ultrasound, with t… ▽ More

    Submitted 29 May, 2021; originally announced May 2021.

    Journal ref: Optical Engineering 60(5), 054106 (2021)

  34. arXiv:2105.08826  [pdf, other

    eess.IV cs.CV cs.LG

    Real-Time Video Super-Resolution on Smartphones with Deep Learning, Mobile AI 2021 Challenge: Report

    Authors: Andrey Ignatov, Andres Romero, Heewon Kim, Radu Timofte, Chiu Man Ho, Zibo Meng, Kyoung Mu Lee, Yuxiang Chen, Yutong Wang, Zeyu Long, Chenhao Wang, Yifei Chen, Boshen Xu, Shuhang Gu, Lixin Duan, Wen Li, Wang Bofei, Zhang Diankai, Zheng Chengjian, Liu Shaoli, Gao Si, Zhang Xiaofeng, Lu Kaidi, Xu Tianyu, Zheng Hui , et al. (6 additional authors not shown)

    Abstract: Video super-resolution has recently become one of the most important mobile-related problems due to the rise of video communication and streaming services. While many solutions have been proposed for this task, the majority of them are too computationally expensive to run on portable devices with limited hardware resources. To address this problem, we introduce the first Mobile AI challenge, where… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Mobile AI 2021 Workshop and Challenges: https://ai-benchmark.com/workshops/mai/2021/. arXiv admin note: substantial text overlap with arXiv:2105.07825. substantial text overlap with arXiv:2105.08629, arXiv:2105.07809, arXiv:2105.08630

  35. arXiv:2105.00368  [pdf, other

    cs.CV

    MarkerPose: Robust Real-time Planar Target Tracking for Accurate Stereo Pose Estimation

    Authors: Jhacson Meza, Lenny A. Romero, Andres G. Marrugo

    Abstract: Despite the attention marker-less pose estimation has attracted in recent years, marker-based approaches still provide unbeatable accuracy under controlled environmental conditions. Thus, they are used in many fields such as robotics or biomedical applications but are primarily implemented through classical approaches, which require lots of heuristics and parameter tuning for reliable performance… ▽ More

    Submitted 29 May, 2021; v1 submitted 1 May, 2021; originally announced May 2021.

    Comments: Accepted at CVPR 2021 LXCV Workshop

  36. arXiv:2011.05680  [pdf, other

    cs.CV

    Zero-Pair Image to Image Translation using Domain Conditional Normalization

    Authors: Samarth Shukla, Andrés Romero, Luc Van Gool, Radu Timofte

    Abstract: In this paper, we propose an approach based on domain conditional normalization (DCN) for zero-pair image-to-image translation, i.e., translating between two domains which have no paired training data available but each have paired training data with a third domain. We employ a single generator which has an encoder-decoder structure and analyze different implementations of domain conditional norma… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: Paper accepted for publication at WACV 2021

  37. arXiv:2010.11619  [pdf, other

    cs.CV cs.AI

    Self-Supervised Shadow Removal

    Authors: Florin-Alexandru Vasluianu, Andres Romero, Luc Van Gool, Radu Timofte

    Abstract: Shadow removal is an important computer vision task aiming at the detection and successful removal of the shadow produced by an occluded light source and a photo-realistic restoration of the image contents. Decades of re-search produced a multitude of hand-crafted restoration techniques and, more recently, learned solutions from shad-owed and shadow-free training image pairs. In this work,we propo… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: 10 pages, 4 figures, 6 tables

    ACM Class: I.2.10

  38. arXiv:2010.03026  [pdf, other

    cs.CV cs.RO

    Place Recognition in Forests with Urquhart Tessellations

    Authors: Guilherme V. Nardari, Avraham Cohen, Steven W. Chen, Xu Liu, Vaibhav Arcot, Roseli A. F. Romero, Vijay Kumar

    Abstract: In this letter, we present a novel descriptor based on Urquhart tessellations derived from the position of trees in a forest. We propose a framework that uses these descriptors to detect previously seen observations and landmark correspondences, even with partial overlap and noise. We run loop closure detection experiments in simulation and real-world data map-merging from different flights of an… ▽ More

    Submitted 16 November, 2020; v1 submitted 23 September, 2020; originally announced October 2020.

    Comments: 9 pages, 6 Figures

  39. arXiv:2010.02315  [pdf, other

    cs.CV

    SMILE: Semantically-guided Multi-attribute Image and Layout Editing

    Authors: Andrés Romero, Luc Van Gool, Radu Timofte

    Abstract: Attribute image manipulation has been a very active topic since the introduction of Generative Adversarial Networks (GANs). Exploring the disentangled attribute space within a transformation is a very challenging task due to the multiple and mutually-inclusive nature of the facial images, where different labels (eyeglasses, hats, hair, identity, etc.) can co-exist at the same time. Several works a… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

  40. arXiv:2010.01110  [pdf, other

    cs.CV

    AIM 2020 Challenge on Image Extreme Inpainting

    Authors: Evangelos Ntavelis, Andrés Romero, Siavash Bigdeli, Radu Timofte

    Abstract: This paper reviews the AIM 2020 challenge on extreme image inpainting. This report focuses on proposed solutions and results for two different tracks on extreme image inpainting: classical image inpainting and semantically guided image inpainting. The goal of track 1 is to inpaint considerably large part of the image using no supervision but the context. Similarly, the goal of track 2 is to inpain… ▽ More

    Submitted 2 October, 2020; originally announced October 2020.

  41. arXiv:2007.13483  [pdf, other

    cs.LG cs.AI

    Post-Workshop Report on Science meets Engineering in Deep Learning, NeurIPS 2019, Vancouver

    Authors: Levent Sagun, Caglar Gulcehre, Adriana Romero, Negar Rostamzadeh, Stefano Sarao Mannelli

    Abstract: Science meets Engineering in Deep Learning took place in Vancouver as part of the Workshop section of NeurIPS 2019. As organizers of the workshop, we created the following report in an attempt to isolate emerging topics and recurring themes that have been presented throughout the event. Deep learning can still be a complex mix of art and engineering despite its tremendous success in recent years.… ▽ More

    Submitted 29 July, 2020; v1 submitted 25 June, 2020; originally announced July 2020.

    Comments: Report of NeurIPS 2019 workshop SEDL

  42. Active MR k-space Sampling with Reinforcement Learning

    Authors: Luis Pineda, Sumana Basu, Adriana Romero, Roberto Calandra, Michal Drozdzal

    Abstract: Deep learning approaches have recently shown great promise in accelerating magnetic resonance image (MRI) acquisition. The majority of existing work have focused on designing better reconstruction models given a pre-determined acquisition trajectory, ignoring the question of trajectory optimization. In this paper, we focus on learning acquisition trajectories given a fixed image reconstruction mod… ▽ More

    Submitted 7 October, 2020; v1 submitted 20 July, 2020; originally announced July 2020.

    Comments: Presented at the 23rd International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2020

    Journal ref: LNCS vol. 12262 (2020) 23-33

  43. arXiv:2007.03778  [pdf, other

    cs.CV cs.RO

    3D Shape Reconstruction from Vision and Touch

    Authors: Edward J. Smith, Roberto Calandra, Adriana Romero, Georgia Gkioxari, David Meger, Jitendra Malik, Michal Drozdzal

    Abstract: When a toddler is presented a new toy, their instinctual behaviour is to pick it upand inspect it with their hand and eyes in tandem, clearly searching over its surface to properly understand what they are playing with. At any instance here, touch provides high fidelity localized information while vision provides complementary global context. However, in 3D shape reconstruction, the complementary… ▽ More

    Submitted 2 November, 2020; v1 submitted 7 July, 2020; originally announced July 2020.

    Comments: Accepted at Neurips 2020

  44. arXiv:2004.06502  [pdf, other

    cs.CV cs.LG eess.IV

    Unsupervised Multimodal Video-to-Video Translation via Self-Supervised Learning

    Authors: Kangning Liu, Shuhang Gu, Andres Romero, Radu Timofte

    Abstract: Existing unsupervised video-to-video translation methods fail to produce translated videos which are frame-wise realistic, semantic information preserving and video-level consistent. In this work, we propose UVIT, a novel unsupervised video-to-video translation model. Our model decomposes the style and the content, uses the specialized encoder-decoder structure and propagates the inter-frame infor… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

  45. arXiv:2004.04977  [pdf, other

    cs.CV cs.GR cs.LG eess.IV

    SESAME: Semantic Editing of Scenes by Adding, Manipulating or Erasing Objects

    Authors: Evangelos Ntavelis, Andrés Romero, Iason Kastanis, Luc Van Gool, Radu Timofte

    Abstract: Recent advances in image generation gave rise to powerful tools for semantic image editing. However, existing approaches can either operate on a single image or require an abundance of additional information. They are not capable of handling the complete set of editing operations, that is addition, manipulation or removal of semantic concepts. To address these limitations, we propose SESAME, a nov… ▽ More

    Submitted 8 October, 2020; v1 submitted 10 April, 2020; originally announced April 2020.

  46. arXiv:2004.04433  [pdf, other

    cs.CV cs.LG eess.IV

    DeepSEE: Deep Disentangled Semantic Explorative Extreme Super-Resolution

    Authors: Marcel C. Bühler, Andrés Romero, Radu Timofte

    Abstract: Super-resolution (SR) is by definition ill-posed. There are infinitely many plausible high-resolution variants for a given low-resolution natural image. Most of the current literature aims at a single deterministic solution of either high reconstruction fidelity or photo-realistic perceptual quality. In this work, we propose an explorative facial super-resolution framework, DeepSEE, for Deep disen… ▽ More

    Submitted 2 October, 2020; v1 submitted 9 April, 2020; originally announced April 2020.

    Comments: 19 pages. Supplementary material is available on the project page. Accepted for oral presentation at the 15th Asian Conference on Computer Vision (ACCV) 2020

  47. arXiv:2003.04168  [pdf, other

    physics.ins-det cs.CV eess.IV

    Hybrid calibration procedure for fringe projection profilometry based on stereo-vision and polynomial fitting

    Authors: Raul Vargas, Andres G. Marrugo, Song Zhang, Lenny A. Romero

    Abstract: The key to accurate 3D shape measurement in Fringe Projection Profilometry (FPP) is the proper calibration of the measurement system. Current calibration techniques rely on phase-coordinate mapping (PCM) or back-projection stereo-vision (SV) methods. PCM methods are cumbersome to implement as they require precise positioning of the calibration target relative to the FPP system but produce highly a… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

    Comments: Accepted for publication in Applied Optics Vol. 59 No. 13, 2020

    ACM Class: I.4.5; I.4.7

  48. The side effect profile of Clozapine in real world data of three large mental hospitals

    Authors: Ehtesham Iqbal, Risha Govind, Alvin Romero, Olubanke Dzahini, Matthew Broadbent, Robert Stewart, Tanya Smith, Chi-Hun Kim, Nomi Werbeloff, Richard Dobson, Zina Ibrahim

    Abstract: Objective: Mining the data contained within Electronic Health Records (EHRs) can potentially generate a greater understanding of medication effects in the real world, complementing what we know from Randomised control trials (RCTs). We Propose a text mining approach to detect adverse events and medication episodes from the clinical text to enhance our understanding of adverse effects related to Cl… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

  49. arXiv:2001.08311  [pdf, other

    cs.CV

    Learning to adapt class-specific features across domains for semantic segmentation

    Authors: Mikel Menta, Adriana Romero, Joost van de Weijer

    Abstract: Recent advances in unsupervised domain adaptation have shown the effectiveness of adversarial training to adapt features across domains, endowing neural networks with the capability of being tested on a target domain without requiring any training annotations in this domain. The great majority of existing domain adaptation models rely on image translation networks, which often contain a huge amoun… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

    Comments: Master thesis dissertation for the Master in Computer Vision (Barcelona). 11 pages main article and 3 pages appendices. Code available

  50. arXiv:1912.12726  [pdf, other

    cs.RO cs.CV cs.LG

    SLOAM: Semantic Lidar Odometry and Mapping for Forest Inventory

    Authors: Steven W. Chen, Guilherme V. Nardari, Elijah S. Lee, Chao Qu, Xu Liu, Roseli A. F. Romero, Vijay Kumar

    Abstract: This paper describes an end-to-end pipeline for tree diameter estimation based on semantic segmentation and lidar odometry and mapping. Accurate mapping of this type of environment is challenging since the ground and the trees are surrounded by leaves, thorns and vines, and the sensor typically experiences extreme motion. We propose a semantic feature based pose optimization that simultaneously re… ▽ More

    Submitted 29 December, 2019; originally announced December 2019.

    Comments: 8 pages, 5 figures, IEEE Robotics and Automation Letters