Skip to main content

Showing 1–31 of 31 results for author: Micheloni, C

  1. arXiv:2404.09790  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Image Super-Resolution ($\times$4): Methods and Results

    Authors: Zheng Chen, Zongwei Wu, Eduard Zamfir, Kai Zhang, Yulun Zhang, Radu Timofte, Xiaokang Yang, Hongyuan Yu, Cheng Wan, Yuxin Hong, Zhijuan Huang, Yajun Zou, Yuan Huang, Jiamin Lin, Bingnan Han, Xianyu Guan, Yongsheng Yu, Daoan Zhang, Xuanwu Yin, Kunlong Zuo, Jinhua Hao, Kai Zhao, Kun Yuan, Ming Sun, Chao Zhou , et al. (63 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained. The challenge involves generating corresponding high-resolution (HR) images, magnified by a factor of four, from low-resolution (LR) inputs using prior information. The LR images originate from bicubic downsampling degradation. The aim of the challenge i… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: NTIRE 2024 webpage: https://cvlai.net/ntire/2024. Code: https://github.com/zhengchen1999/NTIRE2024_ImageSR_x4

  2. Tracking Skiers from the Top to the Bottom

    Authors: Matteo Dunnhofer, Luca Sordi, Niki Martinel, Christian Micheloni

    Abstract: Skiing is a popular winter sport discipline with a long history of competitive events. In this domain, computer vision has the potential to enhance the understanding of athletes' performance, but its application lags behind other sports due to limited studies and datasets. This paper makes a step forward in filling such gaps. A thorough investigation is performed on the task of skier tracking in a… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024

  3. Visualizing Skiers' Trajectories in Monocular Videos

    Authors: Matteo Dunnhofer, Luca Sordi, Christian Micheloni

    Abstract: Trajectories are fundamental to winning in alpine skiing. Tools enabling the analysis of such curves can enhance the training activity and enrich broadcasting content. In this paper, we propose SkiTraVis, an algorithm to visualize the sequence of points traversed by a skier during its performance. SkiTraVis works on monocular videos and constitutes a pipeline of a visual tracker to model the skier… ▽ More

    Submitted 11 April, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), CVsports workshop

  4. arXiv:2302.01144  [pdf, other

    cs.CV cs.LG eess.IV

    UW-CVGAN: UnderWater Image Enhancement with Capsules Vectors Quantization

    Authors: Rita Pucci, Christian Micheloni, Niki Martinel

    Abstract: The degradation in the underwater images is due to wavelength-dependent light attenuation, scattering, and to the diversity of the water types in which they are captured. Deep neural networks take a step in this field, providing autonomous models able to achieve the enhancement of underwater images. We introduce Underwater Capsules Vectors GAN UWCVGAN based on the discrete features quantization pa… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

  5. arXiv:2210.10413  [pdf, other

    cs.CV eess.IV

    Real Image Super-Resolution using GAN through modeling of LR and HR process

    Authors: Rao Muhammad Umer, Christian Micheloni

    Abstract: The current existing deep image super-resolution methods usually assume that a Low Resolution (LR) image is bicubicly downscaled of a High Resolution (HR) image. However, such an ideal bicubic downsampling process is different from the real LR degradations, which usually come from complicated combinations of different degradation processes, such as camera blur, sensor noise, sharpening artifacts,… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted in 18th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), 2022. arXiv admin note: text overlap with arXiv:2009.03693, arXiv:2005.00953

  6. Visual Object Tracking in First Person Vision

    Authors: Matteo Dunnhofer, Antonino Furnari, Giovanni Maria Farinella, Christian Micheloni

    Abstract: The understanding of human-object interactions is fundamental in First Person Vision (FPV). Visual tracking algorithms which follow the objects manipulated by the camera wearer can provide useful information to effectively model such interactions. In the last years, the computer vision community has significantly improved the performance of tracking algorithms for a large variety of target objects… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: International Journal of Computer Vision (IJCV). arXiv admin note: substantial text overlap with arXiv:2108.13665

  7. CoCoLoT: Combining Complementary Trackers in Long-Term Visual Tracking

    Authors: Matteo Dunnhofer, Christian Micheloni

    Abstract: How to combine the complementary capabilities of an ensemble of different algorithms has been of central interest in visual object tracking. A significant progress on such a problem has been achieved, but considering short-term tracking scenarios. Instead, long-term tracking settings have been substantially ignored by the solutions. In this paper, we explicitly consider long-term tracking scenario… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: International Conference on Pattern Recognition (ICPR) 2022

  8. arXiv:2112.09647  [pdf, other

    cs.CV

    Video-Based Reconstruction of the Trajectories Performed by Skiers

    Authors: Matteo Dunnhofer, Alberto Zurini, Maurizio Dunnhofer, Christian Micheloni

    Abstract: Trajectories are fundamental in different skiing disciplines. Tools enabling the analysis of such curves can enhance the training activity and enrich the broadcasting contents. However, the solutions currently available are based on geo-localized sensors and surface models. In this short paper, we propose a video-based approach to reconstruct the sequence of points traversed by an athlete during i… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

  9. arXiv:2110.13217  [pdf, other

    eess.IV cs.CV cs.LG

    RBSRICNN: Raw Burst Super-Resolution through Iterative Convolutional Neural Network

    Authors: Rao Muhammad Umer, Christian Micheloni

    Abstract: Modern digital cameras and smartphones mostly rely on image signal processing (ISP) pipelines to produce realistic colored RGB images. However, compared to DSLR cameras, low-quality images are usually obtained in many portable mobile devices with compact camera sensors due to their physical limitations. The low-quality images have multiple degradations i.e., sub-pixel shift due to camera motion, m… ▽ More

    Submitted 10 November, 2021; v1 submitted 25 October, 2021; originally announced October 2021.

    Comments: Fourth Workshop on Machine Learning and the Physical Sciences (NeurIPS 2021)

  10. arXiv:2109.07871  [pdf, other

    cs.CV cs.AI

    Resolution based Feature Distillation for Cross Resolution Person Re-Identification

    Authors: Asad Munir, Chengjin Lyu, Bart Goossens, Wilfried Philips, Christian Micheloni

    Abstract: Person re-identification (re-id) aims to retrieve images of same identities across different camera views. Resolution mismatch occurs due to varying distances between person of interest and cameras, this significantly degrades the performance of re-id in real world scenarios. Most of the existing approaches resolve the re-id task as low resolution problem in which a low resolution query image is s… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: 9 pages

  11. Is First Person Vision Challenging for Object Tracking?

    Authors: Matteo Dunnhofer, Antonino Furnari, Giovanni Maria Farinella, Christian Micheloni

    Abstract: Understanding human-object interactions is fundamental in First Person Vision (FPV). Tracking algorithms which follow the objects manipulated by the camera wearer can provide useful cues to effectively model such interactions. Visual tracking solutions available in the computer vision literature have significantly improved their performance in the last years for a large variety of target objects a… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

    Comments: IEEE/CVF International Conference on Computer Vision (ICCV) 2021, Visual Object Tracking Challenge VOT2021 workshop. arXiv admin note: text overlap with arXiv:2011.12263

  12. arXiv:2107.03145  [pdf, other

    eess.IV cs.CV cs.LG

    A Deep Residual Star Generative Adversarial Network for multi-domain Image Super-Resolution

    Authors: Rao Muhammad Umer, Asad Munir, Christian Micheloni

    Abstract: Recently, most of state-of-the-art single image super-resolution (SISR) methods have attained impressive performance by using deep convolutional neural networks (DCNNs). The existing SR methods have limited performance due to a fixed degradation settings, i.e. usually a bicubic downscaling of low-resolution (LR) image. However, in real-world settings, the LR degradation process is unknown which ca… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: 5 pages, 6th International Conference on Smart and Sustainable Technologies 2021. arXiv admin note: text overlap with arXiv:2009.03693, arXiv:2005.00953

  13. arXiv:2106.03839  [pdf, other

    cs.CV

    NTIRE 2021 Challenge on Burst Super-Resolution: Methods and Results

    Authors: Goutam Bhat, Martin Danelljan, Radu Timofte, Kazutoshi Akita, Wooyeong Cho, Haoqiang Fan, Lanpeng Jia, Daeshik Kim, Bruno Lecouat, Youwei Li, Shuaicheng Liu, Ziluan Liu, Ziwei Luo, Takahiro Maeda, Julien Mairal, Christian Micheloni, Xuan Mo, Takeru Oba, Pavel Ostyakov, Jean Ponce, Sanghyeok Son, Jian Sun, Norimichi Ukita, Rao Muhammad Umer, Youliang Yan , et al. (3 additional authors not shown)

    Abstract: This paper reviews the NTIRE2021 challenge on burst super-resolution. Given a RAW noisy burst as input, the task in the challenge was to generate a clean RGB image with 4 times higher resolution. The challenge contained two tracks; Track 1 evaluating on synthetically generated data, and Track 2 using real-world bursts from mobile camera. In the final testing phase, 6 teams submitted results using… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: NTIRE 2021 Burst Super-Resolution challenge report

  14. Weakly-Supervised Domain Adaptation of Deep Regression Trackers via Reinforced Knowledge Distillation

    Authors: Matteo Dunnhofer, Niki Martinel, Christian Micheloni

    Abstract: Deep regression trackers are among the fastest tracking algorithms available, and therefore suitable for real-time robotic applications. However, their accuracy is inadequate in many domains due to distribution shift and overfitting. In this paper we overcome such limitations by presenting the first methodology for domain adaption of such a class of trackers. To reduce the labeling effort we propo… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: IEEE Robotics and Automation Letters (RA-L)

  15. arXiv:2101.07576  [pdf, other

    cs.CV cs.LG

    Collaboration among Image and Object Level Features for Image Colourisation

    Authors: Rita Pucci, Christian Micheloni, Niki Martinel

    Abstract: Image colourisation is an ill-posed problem, with multiple correct solutions which depend on the context and object instances present in the input datum. Previous approaches attacked the problem either by requiring intense user interactions or by exploiting the ability of convolutional neural networks (CNNs) in learning image level (context) features. However, obtaining human hints is not always f… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

  16. arXiv:2012.02478  [pdf, other

    cs.CV cs.LG

    Is It a Plausible Colour? UCapsNet for Image Colourisation

    Authors: Rita Pucci, Christian Micheloni, Gian Luca Foresti, Niki Martinel

    Abstract: Human beings can imagine the colours of a grayscale image with no particular effort thanks to their ability of semantic feature extraction. Can an autonomous system achieve that? Can it hallucinate plausible and vibrant colours? This is the colourisation problem. Different from existing works relying on convolutional neural network models pre-trained with supervision, we cast such colourisation pr… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

  17. arXiv:2011.12263  [pdf, other

    cs.CV

    Is First Person Vision Challenging for Object Tracking?

    Authors: Matteo Dunnhofer, Antonino Furnari, Giovanni Maria Farinella, Christian Micheloni

    Abstract: Understanding human-object interactions is fundamental in First Person Vision (FPV). Tracking algorithms which follow the objects manipulated by the camera wearer can provide useful cues to effectively model such interactions. Despite a few previous attempts to exploit trackers in FPV applications, a methodical analysis of the performance of state-of-the-art visual trackers in this domain is still… ▽ More

    Submitted 24 September, 2021; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: Extended Abstract accepted by the EPIC workshop at ICCV 2021. The full version of this paper is available at arXiv:2108.13665

  18. arXiv:2009.12072  [pdf, other

    cs.CV

    AIM 2020 Challenge on Real Image Super-Resolution: Methods and Results

    Authors: Pengxu Wei, Hannan Lu, Radu Timofte, Liang Lin, Wangmeng Zuo, Zhihong Pan, Baopu Li, Teng Xi, Yanwen Fan, Gang Zhang, Jingtuo Liu, Junyu Han, Errui Ding, Tangxin Xie, Liang Cao, Yan Zou, Yi Shen, Jialiang Zhang, Yu Jia, Kaihua Cheng, Chenhuan Wu, Yue Lin, Cen Liu, Yunbo Peng, Xueyi Zou , et al. (51 additional authors not shown)

    Abstract: This paper introduces the real image Super-Resolution (SR) challenge that was part of the Advances in Image Manipulation (AIM) workshop, held in conjunction with ECCV 2020. This challenge involves three tracks to super-resolve an input image for $\times$2, $\times$3 and $\times$4 scaling factors, respectively. The goal is to attract more attention to realistic image degradation for the SR task, wh… ▽ More

    Submitted 25 September, 2020; originally announced September 2020.

    Journal ref: European Conference on Computer Vision Workshops, 2020

  19. arXiv:2009.06943  [pdf, other

    eess.IV cs.CV

    AIM 2020 Challenge on Efficient Super-Resolution: Methods and Results

    Authors: Kai Zhang, Martin Danelljan, Yawei Li, Radu Timofte, Jie Liu, Jie Tang, Gangshan Wu, Yu Zhu, Xiangyu He, Wenjie Xu, Chenghua Li, Cong Leng, Jian Cheng, Guangyang Wu, Wenyi Wang, Xiaohong Liu, Hengyuan Zhao, Xiangtao Kong, Jingwen He, Yu Qiao, Chao Dong, Xiaotong Luo, Liang Chen, Jiangtao Zhang, Maitreya Suin , et al. (60 additional authors not shown)

    Abstract: This paper reviews the AIM 2020 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The challenge task was to super-resolve an input image with a magnification factor x4 based on a set of prior examples of low and corresponding high resolution images. The goal is to devise a network that reduces one or several aspects such as runtime, parameter co… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

  20. arXiv:2009.04809  [pdf, other

    eess.IV cs.CV

    Deep Iterative Residual Convolutional Network for Single Image Super-Resolution

    Authors: Rao Muhammad Umer, Gian Luca Foresti, Christian Micheloni

    Abstract: Deep convolutional neural networks (CNNs) have recently achieved great success for single image super-resolution (SISR) task due to their powerful feature representation capabilities. The most recent deep learning based SISR methods focus on designing deeper / wider models to learn the non-linear mapping between low-resolution (LR) inputs and high-resolution (HR) outputs. These existing SR methods… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

    Comments: To be appeared in proceedings of the 25th IEEE International Conference on Pattern Recognition (ICPR). arXiv admin note: text overlap with arXiv:2005.00953, arXiv:2009.03693

  21. arXiv:2009.03693  [pdf, other

    eess.IV cs.CV

    Deep Cyclic Generative Adversarial Residual Convolutional Networks for Real Image Super-Resolution

    Authors: Rao Muhammad Umer, Christian Micheloni

    Abstract: Recent deep learning based single image super-resolution (SISR) methods mostly train their models in a clean data domain where the low-resolution (LR) and the high-resolution (HR) images come from noise-free settings (same domain) due to the bicubic down-sampling assumption. However, such degradation process is not available in real-world settings. We consider a deep cyclic network structure to ma… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

    Comments: In proceedings of European Conference on Computer Vision (ECCV) Workshops. arXiv admin note: substantial text overlap with arXiv:2005.00953

  22. An Exploration of Target-Conditioned Segmentation Methods for Visual Object Trackers

    Authors: Matteo Dunnhofer, Niki Martinel, Christian Micheloni

    Abstract: Visual object tracking is the problem of predicting a target object's state in a video. Generally, bounding-boxes have been used to represent states, and a surge of effort has been spent by the community to produce efficient causal algorithms capable of locating targets with such representations. As the field is moving towards binary segmentation masks to define objects more precisely, in this pap… ▽ More

    Submitted 13 August, 2020; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: European Conference on Computer Vision (ECCV) 2020, Visual Object Tracking Challenge VOT2020 workshop

  23. Tracking-by-Trackers with a Distilled and Reinforced Model

    Authors: Matteo Dunnhofer, Niki Martinel, Christian Micheloni

    Abstract: Visual object tracking was generally tackled by reasoning independently on fast processing algorithms, accurate online adaptation methods, and fusion of trackers. In this paper, we unify such goals by proposing a novel tracking methodology that takes advantage of other visual trackers, offline and online. A compact student model is trained via the marriage of knowledge distillation and reinforceme… ▽ More

    Submitted 30 September, 2020; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: Asian Conference on Computer Vision (ACCV) 2020

  24. arXiv:2005.01996  [pdf, other

    eess.IV cs.CV

    NTIRE 2020 Challenge on Real-World Image Super-Resolution: Methods and Results

    Authors: Andreas Lugmayr, Martin Danelljan, Radu Timofte, Namhyuk Ahn, Dongwoon Bai, Jie Cai, Yun Cao, Junyang Chen, Kaihua Cheng, SeYoung Chun, Wei Deng, Mostafa El-Khamy, Chiu Man Ho, Xiaozhong Ji, Amin Kheradmand, Gwantae Kim, Hanseok Ko, Kanghyu Lee, Jungwon Lee, Hao Li, Ziluan Liu, Zhi-Song Liu, Shuai Liu, Yunhua Lu, Zibo Meng , et al. (21 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2020 challenge on real world super-resolution. It focuses on the participating methods and final results. The challenge addresses the real world setting, where paired true high and low-resolution images are unavailable. For training, only one set of source input images is therefore provided along with a set of unpaired high-quality target images. In Track 1: Image Proc… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

  25. arXiv:2005.00953  [pdf, other

    eess.IV cs.CV

    Deep Generative Adversarial Residual Convolutional Networks for Real-World Super-Resolution

    Authors: Rao Muhammad Umer, Gian Luca Foresti, Christian Micheloni

    Abstract: Most current deep learning based single image super-resolution (SISR) methods focus on designing deeper / wider models to learn the non-linear mapping between low-resolution (LR) inputs and the high-resolution (HR) outputs from a large number of paired (LR/HR) training data. They usually take as assumption that the LR image is a bicubic down-sampled version of the HR image. However, such degradati… ▽ More

    Submitted 2 May, 2020; originally announced May 2020.

  26. arXiv:2004.06154  [pdf, other

    cs.CV cs.RO

    An Efficient UAV-based Artificial Intelligence Framework for Real-Time Visual Tasks

    Authors: Enkhtogtokh Togootogtokh, Christian Micheloni, Gian Luca Foresti, Niki Martinel

    Abstract: Modern Unmanned Aerial Vehicles equipped with state of the art artificial intelligence (AI) technologies are opening to a wide plethora of novel and interesting applications. While this field received a strong impact from the recent AI breakthroughs, most of the provided solutions either entirely rely on commercial software or provide a weak integration interface which denies the development of ad… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

  27. arXiv:1910.04856  [pdf, other

    cs.CV cs.LG stat.ML

    Video-Based Convolutional Attention for Person Re-Identification

    Authors: Marco Zamprogno, Marco Passon, Niki Martinel, Giuseppe Serra, Giuseppe Lancioni, Christian Micheloni, Carlo Tasso, Gian Luca Foresti

    Abstract: In this paper we consider the problem of video-based person re-identification, which is the task of associating videos of the same person captured by different and non-overlapping cameras. We propose a Siamese framework in which video frames of the person to re-identify and of the candidate one are processed by two identical networks which produce a similarity score. We introduce an attention mech… ▽ More

    Submitted 26 September, 2019; originally announced October 2019.

    Comments: 11 pages, 2 figures. Accepted by ICIAP2019, 20th International Conference on IMAGE ANALYSIS AND PROCESSING, Trento, Italy, 9-13 September, 2019

  28. Visual Tracking by means of Deep Reinforcement Learning and an Expert Demonstrator

    Authors: Matteo Dunnhofer, Niki Martinel, Gian Luca Foresti, Christian Micheloni

    Abstract: In the last decade many different algorithms have been proposed to track a generic object in videos. Their execution on recent large-scale video datasets can produce a great amount of various tracking behaviours. New trends in Reinforcement Learning showed that demonstrations of an expert agent can be efficiently used to speed-up the process of policy learning. Taking inspiration from such works a… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

    Comments: in 2019 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) - VOT2019 Challenge Workshop

  29. Deep Super-Resolution Network for Single Image Super-Resolution with Realistic Degradations

    Authors: Rao Muhammad Umer, Gian Luca Foresti, Christian Micheloni

    Abstract: Single Image Super-Resolution (SISR) aims to generate a high-resolution (HR) image of a given low-resolution (LR) image. The most of existing convolutional neural network (CNN) based SISR methods usually take an assumption that a LR image is only bicubicly down-sampled version of an HR image. However, the true degradation (i.e. the LR image is a bicubicly downsampled, blurred and noisy version of… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: 7 pages

    Journal ref: 13th International Conference on Distributed Smart Cameras (ICDSC 2019)

  30. arXiv:1612.06543  [pdf, other

    cs.CV

    Wide-Slice Residual Networks for Food Recognition

    Authors: Niki Martinel, Gian Luca Foresti, Christian Micheloni

    Abstract: Food diary applications represent a tantalizing market. Such applications, based on image food recognition, opened to new challenges for computer vision and pattern recognition algorithms. Recent works in the field are focusing either on hand-crafted representations or on learning these by exploiting deep neural networks. Despite the success of such a last family of works, these generally exploit… ▽ More

    Submitted 20 December, 2016; originally announced December 2016.

  31. arXiv:1607.07216  [pdf, other

    cs.CV

    Temporal Model Adaptation for Person Re-Identification

    Authors: Niki Martinel, Abir Das, Christian Micheloni, Amit K. Roy-Chowdhury

    Abstract: Person re-identification is an open and challenging problem in computer vision. Majority of the efforts have been spent either to design the best feature representation or to learn the optimal matching metric. Most approaches have neglected the problem of adapting the selected features or the learned model over time. To address such a problem, we propose a temporal model adaptation scheme with hum… ▽ More

    Submitted 25 July, 2016; originally announced July 2016.