Skip to main content

Showing 1–14 of 14 results for author: Katsavounidis, I

  1. arXiv:2404.16484  [pdf, other

    cs.CV eess.IV

    Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey

    Authors: Marcos V. Conde, Zhijun Lei, Wen Li, Cosmin Stejerean, Ioannis Katsavounidis, Radu Timofte, Kihwan Yoon, Ganzorig Gankhuyag, Jiangtao Lv, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Zhiyuan Li, Hao Wei, Chenyang Ge, Dongyang Zhang, Tianle Liu, Huaian Chen, Yi Jin, Menghan Zhou, Yiqiang Yan, Si Gao, Biao Wu, Shaoli Liu , et al. (50 additional authors not shown)

    Abstract: This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use a diverse test set containing a variety of 4K images ranging from digital art to gaming and photography. The images are compressed using the modern AVIF cod… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: CVPR 2024, AI for Streaming (AIS) Workshop

  2. arXiv:2404.13484  [pdf, other

    eess.IV cs.CV

    Joint Quality Assessment and Example-Guided Image Processing by Disentangling Picture Appearance from Content

    Authors: Abhinau K. Venkataramanan, Cosmin Stejerean, Ioannis Katsavounidis, Hassene Tmar, Alan C. Bovik

    Abstract: The deep learning revolution has strongly impacted low-level image processing tasks such as style/domain transfer, enhancement/restoration, and visual quality assessments. Despite often being treated separately, the aforementioned tasks share a common theme of understanding, editing, or enhancing the appearance of input images without modifying the underlying content. We leverage this observation… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  3. arXiv:2404.13452  [pdf, other

    eess.IV cs.CV

    Cut-FUNQUE: An Objective Quality Model for Compressed Tone-Mapped High Dynamic Range Videos

    Authors: Abhinau K. Venkataramanan, Cosmin Stejerean, Ioannis Katsavounidis, Hassene Tmar, Alan C. Bovik

    Abstract: High Dynamic Range (HDR) videos have enjoyed a surge in popularity in recent years due to their ability to represent a wider range of contrast and color than Standard Dynamic Range (SDR) videos. Although HDR video capture has seen increasing popularity because of recent flagship mobile phones such as Apple iPhones, Google Pixels, and Samsung Galaxy phones, a broad swath of consumers still utilize… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  4. arXiv:2401.16067  [pdf, other

    eess.IV cs.MM

    Encoding Time and Energy Model for SVT-AV1 based on Video Complexity

    Authors: Lena Eichermüller, Gaurang Chaudhari, Ioannis Katsavounidis, Zhijun Lei, Hassene Tmar, Christian Herglotz, André Kaup

    Abstract: The share of online video traffic in global carbon dioxide emissions is growing steadily. To comply with the demand for video media, dedicated compression techniques are continuously optimized, but at the expense of increasingly higher computational demands and thus rising energy consumption at the video encoder side. In order to find the best trade-off between compression and energy consumption,… ▽ More

    Submitted 30 January, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: 5 pages, 1 figure, accepted for IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2024

  5. arXiv:2312.08524  [pdf, other

    eess.IV cs.CV

    A FUNQUE Approach to the Quality Assessment of Compressed HDR Videos

    Authors: Abhinau K. Venkataramanan, Cosmin Stejerean, Ioannis Katsavounidis, Alan C. Bovik

    Abstract: Recent years have seen steady growth in the popularity and availability of High Dynamic Range (HDR) content, particularly videos, streamed over the internet. As a result, assessing the subjective quality of HDR videos, which are generally subjected to compression, is of increasing importance. In particular, we target the task of full-reference quality assessment of compressed HDR videos. The state… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  6. Study of Subjective and Objective Quality Assessment of Mobile Cloud Gaming Videos

    Authors: Avinab Saha, Yu-Chih Chen, Chase Davis, Bo Qiu, Xiaoming Wang, Rahul Gowda, Ioannis Katsavounidis, Alan C. Bovik

    Abstract: We present the outcomes of a recent large-scale subjective study of Mobile Cloud Gaming Video Quality Assessment (MCG-VQA) on a diverse set of gaming videos. Rapid advancements in cloud services, faster video encoding technologies, and increased access to high-speed, low-latency wireless internet have all contributed to the exponential growth of the Mobile Cloud Gaming industry. Consequently, the… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE Transactions on Image Processing, 2023. The database will be publicly available by 1st week of July 2023

  7. arXiv:2305.02422  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    GAMIVAL: Video Quality Prediction on Mobile Cloud Gaming Content

    Authors: Yu-Chih Chen, Avinab Saha, Chase Davis, Bo Qiu, Xiaoming Wang, Rahul Gowda, Ioannis Katsavounidis, Alan C. Bovik

    Abstract: The mobile cloud gaming industry has been rapidly growing over the last decade. When streaming gaming videos are transmitted to customers' client devices from cloud servers, algorithms that can monitor distorted video quality without having any reference video available are desirable tools. However, creating No-Reference Video Quality Assessment (NR VQA) models that can accurately predict the qual… ▽ More

    Submitted 29 August, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE SPL 2023. The implementation of GAMIVAL has been made available online: https://github.com/lskdream/GAMIVAL

    MSC Class: 68U10

    Journal ref: IEEE Signal Processing Letters, vol. 30, pp. 324-328, 2023

  8. arXiv:2209.12139  [pdf, other

    cs.CV

    Lightweight Image Codec via Multi-Grid Multi-Block-Size Vector Quantization (MGBVQ)

    Authors: Yifan Wang, Zhanxuan Mei, Ioannis Katsavounidis, C. -C. Jay Kuo

    Abstract: A multi-grid multi-block-size vector quantization (MGBVQ) method is proposed for image coding in this work. The fundamental idea of image coding is to remove correlations among pixels before quantization and entropy coding, e.g., the discrete cosine transform (DCT) and intra predictions, adopted by modern image coding standards. We present a new method to remove pixel correlations. First, by decom… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

    Comments: GIC-python-v2

  9. arXiv:2102.00502  [pdf, other

    cs.MM eess.IV

    A Machine Learning Approach to Optimal Inverse Discrete Cosine Transform (IDCT) Design

    Authors: Yifan Wang, Zhanxuan Mei, Chia-Yang Tsai, Ioannis Katsavounidis, C. -C. Jay Kuo

    Abstract: The design of the optimal inverse discrete cosine transform (IDCT) to compensate the quantization error is proposed for effective lossy image compression in this work. The forward and inverse DCTs are designed in pair in current image/video coding standards without taking the quantization effect into account. Yet, the distribution of quantized DCT coefficients deviate from that of original DCT coe… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

    Comments: conference

  10. arXiv:2101.06354  [pdf, other

    eess.IV cs.CV cs.MM

    A Hitchhiker's Guide to Structural Similarity

    Authors: Abhinau K. Venkataramanan, Chengyang Wu, Alan C. Bovik, Ioannis Katsavounidis, Zafar Shahid

    Abstract: The Structural Similarity (SSIM) Index is a very widely used image/video quality model that continues to play an important role in the perceptual evaluation of compression algorithms, encoding recipes and numerous other image/video processing algorithms. Several public implementations of the SSIM and Multiscale-SSIM (MS-SSIM) algorithms have been developed, which differ in efficiency and performan… ▽ More

    Submitted 30 January, 2021; v1 submitted 15 January, 2021; originally announced January 2021.

    Comments: Submitted final version to IEEE Access on January 30, 2021

  11. arXiv:2004.02067  [pdf, other

    cs.MM eess.IV

    A Simple Model for Subject Behavior in Subjective Experiments

    Authors: Zhi Li, Christos G. Bampis, Lukáš Krasula, Lucjan Janowski, Ioannis Katsavounidis

    Abstract: In a subjective experiment to evaluate the perceptual audiovisual quality of multimedia and television services, raw opinion scores collected from test subjects are often noisy and unreliable. To produce the final mean opinion scores (MOS), recommendations such as ITU-R BT.500, ITU-T P.910 and ITU-T P.913 standardize post-test screening procedures to clean up the raw opinion scores, using techniqu… ▽ More

    Submitted 6 May, 2021; v1 submitted 4 April, 2020; originally announced April 2020.

    Comments: 14 pages, updated version of the original paper published in Human Vision and Electronic Imaging (HVEI) 2020

  12. arXiv:1807.10894  [pdf, other

    cs.MM

    A user model for JND-based video quality assessment: theory and applications

    Authors: Haiqiang Wang, Ioannis Katsavounidis, Xinfeng Zhang, Chao Yang, C. -C. Jay Kuo

    Abstract: The video quality assessment (VQA) technology has attracted a lot of attention in recent years due to an increasing demand of video streaming services. Existing VQA methods are designed to predict video quality in terms of the mean opinion score (MOS) calibrated by humans in subjective experiments. However, they cannot predict the satisfied user ratio (SUR) of an aggregated viewer group. Furthermo… ▽ More

    Submitted 28 July, 2018; originally announced July 2018.

    Comments: To appear at SPIE 2018

  13. arXiv:1710.11090  [pdf, other

    cs.MM

    Prediction of Satisfied User Ratio for Compressed Video

    Authors: Haiqiang Wang, Ioannis Katsavounidis, Qin Huang, Xin Zhou, C. -C. Jay Kuo

    Abstract: A large-scale video quality dataset called the VideoSet has been constructed recently to measure human subjective experience of H.264 coded video in terms of the just-noticeable-difference (JND). It measures the first three JND points of 5-second video of resolution 1080p, 720p, 540p and 360p. Based on the VideoSet, we propose a method to predict the satisfied-user-ratio (SUR) curves using a machi… ▽ More

    Submitted 30 October, 2017; originally announced October 2017.

  14. arXiv:1701.01500  [pdf, other

    cs.MM

    VideoSet: A Large-Scale Compressed Video Quality Dataset Based on JND Measurement

    Authors: Haiqiang Wang, Ioannis Katsavounidis, Jiantong Zhou, Jeonghoon Park, Shawmin Lei, Xin Zhou, Man-On Pun, Xin Jin, Ronggang Wang, Xu Wang, Yun Zhang, Jiwu Huang, Sam Kwong, C. -C. Jay Kuo

    Abstract: A new methodology to measure coded image/video quality using the just-noticeable-difference (JND) idea was proposed. Several small JND-based image/video quality datasets were released by the Media Communications Lab at the University of Southern California. In this work, we present an effort to build a large-scale JND-based coded video quality dataset. The dataset consists of 220 5-second sequence… ▽ More

    Submitted 14 January, 2017; v1 submitted 5 January, 2017; originally announced January 2017.