Skip to main content

Showing 1–19 of 19 results for author: Shah, N A

  1. arXiv:2307.11081  [pdf, other

    cs.CV cs.LG

    GLSFormer: Gated - Long, Short Sequence Transformer for Step Recognition in Surgical Videos

    Authors: Nisarg A. Shah, Shameema Sikder, S. Swaroop Vedula, Vishal M. Patel

    Abstract: Automated surgical step recognition is an important task that can significantly improve patient safety and decision-making during surgeries. Existing state-of-the-art methods for surgical step recognition either rely on separate, multi-stage modeling of spatial and temporal information or operate on short-range temporal resolution when learned jointly. However, the benefits of joint modeling of sp… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: Accepted to MICCAI 2023 (Early Accept)

  2. arXiv:2203.10363  [pdf, other

    cs.CV cs.GR eess.IV

    Towards Device Efficient Conditional Image Generation

    Authors: Nisarg A. Shah, Gaurav Bharaj

    Abstract: We present a novel algorithm to reduce tensor compute required by a conditional image generation autoencoder without sacrificing quality of photo-realistic image generation. Our method is device agnostic, and can optimize an autoencoder for a given CPU-only, GPU compute device(s) in about normal time it takes to train an autoencoder on a generic workstation. We achieve this via a two-stage novel s… ▽ More

    Submitted 13 October, 2022; v1 submitted 19 March, 2022; originally announced March 2022.

    Comments: British Machine Vision Conference 2022

  3. arXiv:2203.00845  [pdf, other

    eess.IV cs.AI cs.CV

    Can No-reference features help in Full-reference image quality estimation?

    Authors: Saikat Dutta, Sourya Dipta Das, Nisarg A. Shah

    Abstract: Development of perceptual image quality assessment (IQA) metrics has been of significant interest to computer vision community. The aim of these metrics is to model quality of an image as perceived by humans. Recent works in Full-reference IQA research perform pixelwise comparison between deep features corresponding to query and reference images for quality prediction. However, pixelwise feature c… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: Code to be updated on: https://github.com/saikatdutta/nr-in-friqa

  4. ADAM Challenge: Detecting Age-related Macular Degeneration from Fundus Images

    Authors: Huihui Fang, Fei Li, Huazhu Fu, Xu Sun, Xingxing Cao, Fengbin Lin, Jaemin Son, Sunho Kim, Gwenole Quellec, Sarah Matta, Sharath M Shankaranarayana, Yi-Ting Chen, Chuen-heng Wang, Nisarg A. Shah, Chia-Yen Lee, Chih-Chung Hsu, Hai Xie, Baiying Lei, Ujjwal Baid, Shubham Innani, Kang Dang, Wenxiu Shi, Ravi Kamble, Nitin Singhal, Ching-Wei Wang , et al. (6 additional authors not shown)

    Abstract: Age-related macular degeneration (AMD) is the leading cause of visual impairment among elderly in the world. Early detection of AMD is of great importance, as the vision loss caused by this disease is irreversible and permanent. Color fundus photography is the most cost-effective imaging modality to screen for retinal disorders. Cutting edge deep learning based algorithms have been recently develo… ▽ More

    Submitted 6 May, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: 31 pages, 17 figures

  5. arXiv:2201.11506  [pdf, other

    cs.CV

    Anomaly Detection in Retinal Images using Multi-Scale Deep Feature Sparse Coding

    Authors: Sourya Dipta Das, Saikat Dutta, Nisarg A. Shah, Dwarikanath Mahapatra, Zongyuan Ge

    Abstract: Convolutional Neural Network models have successfully detected retinal illness from optical coherence tomography (OCT) and fundus images. These CNN models frequently rely on vast amounts of labeled data for training, difficult to obtain, especially for rare diseases. Furthermore, a deep learning system trained on a data set with only one or a few diseases cannot detect other diseases, limiting the… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

    Comments: Accepted to ISBI 2022.©IEEE

  6. arXiv:2112.10074  [pdf, other

    eess.IV cs.CV cs.LG

    QU-BraTS: MICCAI BraTS 2020 Challenge on Quantifying Uncertainty in Brain Tumor Segmentation - Analysis of Ranking Scores and Benchmarking Results

    Authors: Raghav Mehta, Angelos Filos, Ujjwal Baid, Chiharu Sako, Richard McKinley, Michael Rebsamen, Katrin Datwyler, Raphael Meier, Piotr Radojewski, Gowtham Krishnan Murugesan, Sahil Nalawade, Chandan Ganesh, Ben Wagner, Fang F. Yu, Baowei Fei, Ananth J. Madhuranthakam, Joseph A. Maldjian, Laura Daza, Catalina Gomez, Pablo Arbelaez, Chengliang Dai, Shuo Wang, Hadrien Reynaud, Yuan-han Mo, Elsa Angelini , et al. (67 additional authors not shown)

    Abstract: Deep learning (DL) models have provided state-of-the-art performance in various medical imaging benchmarking challenges, including the Brain Tumor Segmentation (BraTS) challenges. However, the task of focal pathology multi-compartment segmentation (e.g., tumor and lesion sub-regions) is particularly challenging, and potential errors hinder translating DL models into clinical workflows. Quantifying… ▽ More

    Submitted 23 August, 2022; v1 submitted 19 December, 2021; originally announced December 2021.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA): https://www.melba-journal.org/papers/2022:026.html

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 1 (2022)

  7. arXiv:2107.06125  [pdf, other

    cs.CV

    MSR-Net: Multi-Scale Relighting Network for One-to-One Relighting

    Authors: Sourya Dipta Das, Nisarg A. Shah, Saikat Dutta

    Abstract: Deep image relighting allows photo enhancement by illumination-specific retouching without human effort and so it is getting much interest lately. Most of the existing popular methods available for relighting are run-time intensive and memory inefficient. Keeping these issues in mind, we propose the use of Stacked Deep Multi-Scale Hierarchical Network, which aggregates features from each image at… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

    Comments: Workshop on Differentiable Vision, Graphics, and Physics in Machine Learning at NeurIPS 2020. arXiv admin note: text overlap with arXiv:2102.09242

  8. arXiv:2105.08819  [pdf, other

    eess.IV cs.CV cs.LG

    Fast and Accurate Quantized Camera Scene Detection on Smartphones, Mobile AI 2021 Challenge: Report

    Authors: Andrey Ignatov, Grigory Malivenko, Radu Timofte, Sheng Chen, Xin Xia, Zhaoyan Liu, Yuwei Zhang, Feng Zhu, Jiashi Li, Xuefeng Xiao, Yuan Tian, Xinglong Wu, Christos Kyrkou, Yixin Chen, Zexin Zhang, Yunbo Peng, Yue Lin, Saikat Dutta, Sourya Dipta Das, Nisarg A. Shah, Himanshu Kumar, Chao Ge, Pei-Lin Wu, Jin-Hua Du, Andrew Batutin , et al. (6 additional authors not shown)

    Abstract: Camera scene detection is among the most popular computer vision problem on smartphones. While many custom solutions were developed for this task by phone vendors, none of the designed models were available publicly up until now. To address this problem, we introduce the first Mobile AI challenge, where the target is to develop quantized deep learning-based camera scene classification solutions th… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Mobile AI 2021 Workshop and Challenges: https://ai-benchmark.com/workshops/mai/2021/. arXiv admin note: substantial text overlap with arXiv:2105.08630; text overlap with arXiv:2105.07825, arXiv:2105.07809, arXiv:2105.08629

  9. arXiv:2105.07174  [pdf, other

    cs.CV cs.AI

    Stacked Deep Multi-Scale Hierarchical Network for Fast Bokeh Effect Rendering from a Single Image

    Authors: Saikat Dutta, Sourya Dipta Das, Nisarg A. Shah, Anil Kumar Tiwari

    Abstract: The Bokeh Effect is one of the most desirable effects in photography for rendering artistic and aesthetic photos. Usually, it requires a DSLR camera with different aperture and shutter settings and certain photography skills to generate this effect. In smartphones, computational methods and additional sensors are used to overcome the physical lens and sensor limitations to achieve such effect. Mos… ▽ More

    Submitted 15 May, 2021; originally announced May 2021.

    Comments: Accepted to MAI workshop, CVPR 2021. Code and models: https://github.com/saikatdutta/Stacked_DMSHN_bokeh

  10. arXiv:2104.05778  [pdf, other

    eess.IV cs.CV

    Efficient Space-time Video Super Resolution using Low-Resolution Flow and Mask Upsampling

    Authors: Saikat Dutta, Nisarg A. Shah, Anurag Mittal

    Abstract: This paper explores an efficient solution for Space-time Super-Resolution, aiming to generate High-resolution Slow-motion videos from Low Resolution and Low Frame rate videos. A simplistic solution is the sequential running of Video Super Resolution and Video Frame interpolation models. However, this type of solutions are memory inefficient, have high inference time, and could not make the proper… ▽ More

    Submitted 8 June, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Accepted at NTIRE Workshop, CVPR 2021. Code and models: https://github.com/saikatdutta/FMU_STSR

  11. arXiv:2103.09289  [pdf, other

    eess.IV cs.CV cs.LG

    Colorectal Cancer Segmentation using Atrous Convolution and Residual Enhanced UNet

    Authors: Nisarg A. Shah, Divij Gupta, Romil Lodaya, Ujjwal Baid, Sanjay Talbar

    Abstract: Colorectal cancer is a leading cause of death worldwide. However, early diagnosis dramatically increases the chances of survival, for which it is crucial to identify the tumor in the body. Since its imaging uses high-resolution techniques, annotating the tumor is time-consuming and requires particular expertise. Lately, methods built upon Convolutional Neural Networks(CNNs) have proven to be at pa… ▽ More

    Submitted 16 March, 2021; originally announced March 2021.

    Comments: 5th IAPR International Conference on Computer Vision and Image Processing, 12 pages

  12. arXiv:2102.09242  [pdf, other

    cs.CV

    DSRN: an Efficient Deep Network for Image Relighting

    Authors: Sourya Dipta Das, Nisarg A. Shah, Saikat Dutta, Himanshu Kumar

    Abstract: Custom and natural lighting conditions can be emulated in images of the scene during post-editing. Extraordinary capabilities of the deep learning framework can be utilized for such purpose. Deep image relighting allows automatic photo enhancement by illumination-specific retouching. Most of the state-of-the-art methods for relighting are run-time intensive and memory inefficient. In this paper, w… ▽ More

    Submitted 16 June, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: Accepted at ICIP 2021. $©$ IEEE

  13. arXiv:2011.04988  [pdf, other

    eess.IV cs.CV

    AIM 2020 Challenge on Rendering Realistic Bokeh

    Authors: Andrey Ignatov, Radu Timofte, Ming Qian, Congyu Qiao, Jiamin Lin, Zhenyu Guo, Chenghua Li, Cong Leng, Jian Cheng, Juewen Peng, Xianrui Luo, Ke Xian, Zijin Wu, Zhiguo Cao, Densen Puthussery, Jiji C V, Hrishikesh P S, Melvin Kuriakose, Saikat Dutta, Sourya Dipta Das, Nisarg A. Shah, Kuldeep Purohit, Praveen Kandula, Maitreya Suin, A. N. Rajagopalan , et al. (10 additional authors not shown)

    Abstract: This paper reviews the second AIM realistic bokeh effect rendering challenge and provides the description of the proposed solutions and results. The participating teams were solving a real-world bokeh simulation problem, where the goal was to learn a realistic shallow focus technique using a large-scale EBB! bokeh dataset consisting of 5K shallow / wide depth-of-field image pairs captured using th… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

    Comments: Published in ECCV 2020 Workshop (Advances in Image Manipulation), https://data.vision.ee.ethz.ch/cvl/aim20/

  14. arXiv:2009.12798  [pdf, other

    cs.CV eess.IV

    AIM 2020: Scene Relighting and Illumination Estimation Challenge

    Authors: Majed El Helou, Ruofan Zhou, Sabine Süsstrunk, Radu Timofte, Mahmoud Afifi, Michael S. Brown, Kele Xu, Hengxing Cai, Yuzhong Liu, Li-Wen Wang, Zhi-Song Liu, Chu-Tak Li, Sourya Dipta Das, Nisarg A. Shah, Akashdeep Jassal, Tongtong Zhao, Shanshan Zhao, Sabari Nathan, M. Parisa Beham, R. Suganya, Qing Wang, Zhongyun Hu, Xin Huang, Yaning Li, Maitreya Suin , et al. (12 additional authors not shown)

    Abstract: We review the AIM 2020 challenge on virtual image relighting and illumination estimation. This paper presents the novel VIDIT dataset used in the challenge and the different proposed solutions and final evaluation results over the 3 challenge tracks. The first track considered one-to-one relighting; the objective was to relight an input photo of a scene with a different color temperature and illum… ▽ More

    Submitted 27 September, 2020; originally announced September 2020.

    Comments: ECCVW 2020. Data and more information on https://github.com/majedelhelou/VIDIT

  15. arXiv:2008.07742  [pdf, other

    eess.IV cs.CV

    UDC 2020 Challenge on Image Restoration of Under-Display Camera: Methods and Results

    Authors: Yuqian Zhou, Michael Kwan, Kyle Tolentino, Neil Emerton, Sehoon Lim, Tim Large, Lijiang Fu, Zhihong Pan, Baopu Li, Qirui Yang, Yihao Liu, Jigang Tang, Tao Ku, Shibin Ma, Bingnan Hu, Jiarong Wang, Densen Puthussery, Hrishikesh P S, Melvin Kuriakose, Jiji C V, Varun Sundar, Sumanth Hegde, Divya Kothandaraman, Kaushik Mitra, Akashdeep Jassal , et al. (20 additional authors not shown)

    Abstract: This paper is the report of the first Under-Display Camera (UDC) image restoration challenge in conjunction with the RLQ workshop at ECCV 2020. The challenge is based on a newly-collected database of Under-Display Camera. The challenge tracks correspond to two types of display: a 4k Transparent OLED (T-OLED) and a phone Pentile OLED (P-OLED). Along with about 150 teams registered the challenge, ei… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

    Comments: 15 pages

  16. arXiv:2005.04117  [pdf, other

    cs.CV eess.IV

    NTIRE 2020 Challenge on Real Image Denoising: Dataset, Methods and Results

    Authors: Abdelrahman Abdelhamed, Mahmoud Afifi, Radu Timofte, Michael S. Brown, Yue Cao, Zhilu Zhang, Wangmeng Zuo, Xiaoling Zhang, Jiye Liu, Wendong Chen, Changyuan Wen, Meng Liu, Shuailin Lv, Yunchao Zhang, Zhihong Pan, Baopu Li, Teng Xi, Yanwen Fan, Xiyu Yu, Gang Zhang, Jingtuo Liu, Junyu Han, Errui Ding, Songhyun Yu, Bumjun Park , et al. (65 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2020 challenge on real image denoising with focus on the newly introduced dataset, the proposed methods and their results. The challenge is a new version of the previous NTIRE 2019 challenge on real image denoising that was based on the SIDD benchmark. This challenge is based on a newly collected validation and testing image datasets, and hence, named SIDD+. This chall… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

  17. arXiv:1807.10568  [pdf

    cs.CV

    Barqi Breed Sheep Weight Estimation based on Neural Network with Regression

    Authors: Chintan Bhatt, Aboul-ella Hassanien, Nirav Alpesh Shah, Jaydeep Thik

    Abstract: Computer vision is a very powerful method for understanding the contents from the images. We tried to utilize this powerful technology to make the difficult task of estimating sheep weights quick and accurate. It has enabled us to minimize the human involvement in measuring weight of the sheep. We are using a novel approach for segmentation and neural network based regression model for achieving b… ▽ More

    Submitted 9 July, 2018; originally announced July 2018.

  18. arXiv:1112.5608  [pdf

    cs.CR

    Detecting Threat E-mails using Bayesian Approach

    Authors: M. Tariq Banday, Jameel A. Qadri, Tariq. R. Jan, Nisar. A. Shah

    Abstract: Fraud and terrorism have a close connect in terms of the processes that enables and promote them. In the era of Internet, its various services that include Web, e-mail, social networks, blogs, instant messaging, chats, etc. are used in terrorism not only for communication but also for i) creation of ideology, ii) resource gathering, iii) recruitment, indoctrination and training, iv) creation of te… ▽ More

    Submitted 23 December, 2011; originally announced December 2011.

    Comments: 10 Pages

    ACM Class: K.6.5; D.4.6; K.4.2

    Journal ref: Banday, M.T., Qadri, J.A., Jan, T.R. and Shah, N.A. (2009). "Detecting Threat E-mails using Bayesian Approach," International Journal of Secure Digital Information Age, ISSN: 0975-1823, 1(2), pp. 103-113

  19. arXiv:1112.5605  [pdf

    cs.CR cs.CY

    A Study of CAPTCHAs for Securing Web Services

    Authors: M. Tariq Banday, N. A. Shah

    Abstract: Atomizing various Web activities by replacing human to human interactions on the Internet has been made indispensable due to its enormous growth. However, bots also known as Web-bots which have a malicious intend and pretending to be humans pose a severe threat to various services on the Internet that implicitly assume a human interaction. Accordingly, Web service providers before allowing access… ▽ More

    Submitted 23 December, 2011; originally announced December 2011.

    Comments: 9 Pages

    ACM Class: K.6.5; D.4.6; K.4.2

    Journal ref: Banday, M.T., Shah, N.A. (2009). "A Study of CAPTCHAs for Securing Web Services," IJSDIA International Journal of Secure Digital Information Age, ISSN: 0975-1823, 1(2), pp. 66-74, available online at: http://ijsdia.org/main/?page_id=6