Skip to main content

Showing 1–50 of 63 results for author: Sethi, A

  1. arXiv:2407.11652  [pdf, other

    cs.CV cs.AI cs.LG

    CCVA-FL: Cross-Client Variations Adaptive Federated Learning for Medical Imaging

    Authors: Sunny Gupta, Amit Sethi

    Abstract: Federated Learning (FL) offers a privacy-preserving approach to train models on decentralized data. Its potential in healthcare is significant, but challenges arise due to cross-client variations in medical image data, exacerbated by limited annotations. This paper introduces Cross-Client Variations Adaptive Federated Learning (CCVA-FL) to address these issues. CCVA-FL aims to minimize cross-clien… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 10 pages, 6 figures

    ACM Class: I.2.10; I.4.0; I.4.1; I.4.2; I.4.6; I.4.7; I.4.8; I.4.9; I.4.10; I.2.10; I.5.1; I.5.2; I.5.4; J.2

  2. arXiv:2406.05612  [pdf, other

    cs.CV cs.AI cs.LG

    Which Backbone to Use: A Resource-efficient Domain Specific Comparison for Computer Vision

    Authors: Pranav Jeevan, Amit Sethi

    Abstract: In contemporary computer vision applications, particularly image classification, architectural backbones pre-trained on large datasets like ImageNet are commonly employed as feature extractors. Despite the widespread use of these pre-trained convolutional neural networks (CNNs), there remains a gap in understanding the performance of various resource-efficient backbones across diverse domains and… ▽ More

    Submitted 29 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

    Comments: 12 pages, 2 figures

    ACM Class: I.2.10; I.4.0; I.4.1; I.4.2; I.4.6; I.4.7; I.4.8; I.4.9; I.4.10; I.2.10; I.5.1; I.5.2; I.5.4; J.2

  3. arXiv:2405.13666  [pdf, ps, other

    cs.LG

    Generalization Bounds for Dependent Data using Online-to-Batch Conversion

    Authors: Sagnik Chatterjee, Manuj Mukherjee, Alhad Sethi

    Abstract: In this work, we give generalization bounds of statistical learning algorithms trained on samples drawn from a dependent data source, both in expectation and with high probability, using the Online-to-Batch conversion paradigm. We show that the generalization error of statistical learners in the dependent data setting is equivalent to the generalization error of statistical learners in the i.i.d.… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  4. arXiv:2403.15089  [pdf, other

    cs.CV

    IFSENet : Harnessing Sparse Iterations for Interactive Few-shot Segmentation Excellence

    Authors: Shreyas Chandgothia, Ardhendu Sekhar, Amit Sethi

    Abstract: Training a computer vision system to segment a novel class typically requires collecting and painstakingly annotating lots of images with objects from that class. Few-shot segmentation techniques reduce the required number of images to learn to segment a new class, but careful annotations of object boundaries are still required. On the other hand, interactive segmentation techniques only focus on… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  5. arXiv:2403.01927  [pdf, other

    q-bio.GN cs.CV q-bio.QM q-bio.TO

    Advancing Gene Selection in Oncology: A Fusion of Deep Learning and Sparsity for Precision Gene Selection

    Authors: Akhila Krishna, Ravi Kant Gupta, Pranav Jeevan, Amit Sethi

    Abstract: Gene selection plays a pivotal role in oncology research for improving outcome prediction accuracy and facilitating cost-effective genomic profiling for cancer patients. This paper introduces two gene selection strategies for deep learning-based survival prediction models. The first strategy uses a sparsity-inducing method while the second one uses importance based gene selection for identifying r… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  6. arXiv:2402.11995  [pdf, other

    cs.LG

    Network Inversion of Binarised Neural Nets

    Authors: Pirzada Suhail, Supratik Chakraborty, Amit Sethi

    Abstract: While the deployment of neural networks, yielding impressive results, becomes more prevalent in various applications, their interpretability and understanding remain a critical challenge. Network inversion, a technique that aims to reconstruct the input space from the model's learned internal representations, plays a pivotal role in unraveling the black-box nature of input to output mappings in ne… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  7. arXiv:2312.12450  [pdf, other

    cs.SE cs.AI cs.LG cs.PL

    Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions

    Authors: Federico Cassano, Luisa Li, Akul Sethi, Noah Shinn, Abby Brennan-Jones, Jacob Ginesin, Edward Berman, George Chakhnashvili, Anton Lozhkov, Carolyn Jane Anderson, Arjun Guha

    Abstract: A significant amount of research is focused on developing and evaluating large language models for a variety of code synthesis tasks. These include synthesizing code from natural language, synthesizing tests from code, and synthesizing explanations of code. In contrast, the behavior of instructional code editing with LLMs is understudied. These are tasks in which the model is provided a block of c… ▽ More

    Submitted 19 March, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

  8. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  9. arXiv:2311.18281  [pdf, other

    eess.IV cs.CV

    Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications

    Authors: Sahar Almahfouz Nasser, Shashwat Pathak, Keshav Singhal, Mohit Meena, Nihar Gupte, Ananya Chinmaya, Prateek Garg, Amit Sethi

    Abstract: Graph neural networks (GNNs) present a promising alternative to CNNs and transformers in certain image processing applications due to their parameter-efficiency in modeling spatial relationships. Currently, a major area of research involves the converting non-graph input data for GNN-based models, notably in scenarios where the data originates from images. One approach involves converting images i… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  10. arXiv:2311.14095  [pdf, other

    cs.CV

    Video Anomaly Detection using GAN

    Authors: Anikeit Sethi, Krishanu Saini, Sai Mounika Mididoddi

    Abstract: Accounting for the increased concern for public safety, automatic abnormal event detection and recognition in a surveillance scene is crucial. It is a current open study subject because of its intricacy and utility. The identification of aberrant events automatically, it's a difficult undertaking because everyone's idea of abnormality is different. A typical occurrence in one circumstance could be… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

  11. arXiv:2310.03346  [pdf, other

    cs.CV

    Combining Datasets with Different Label Sets for Improved Nucleus Segmentation and Classification

    Authors: Amruta Parulekar, Utkarsh Kanwat, Ravi Kant Gupta, Medha Chippa, Thomas Jacob, Tripti Bameta, Swapnil Rane, Amit Sethi

    Abstract: Segmentation and classification of cell nuclei in histopathology images using deep neural networks (DNNs) can save pathologists' time for diagnosing various diseases, including cancers, by automating cell counting and morphometric assessments. It is now well-known that the accuracy of DNNs increases with the sizes of annotated datasets available for training. Although multiple datasets of histopat… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  12. arXiv:2309.17172  [pdf, other

    cs.CV

    Domain-Adaptive Learning: Unsupervised Adaptation for Histology Images with Improved Loss Function Combination

    Authors: Ravi Kant Gupta, Shounak Das, Amit Sethi

    Abstract: This paper presents a novel approach for unsupervised domain adaptation (UDA) targeting H&E stained histology images. Existing adversarial domain adaptation methods may not effectively align different domains of multimodal distributions associated with classification problems. The objective is to enhance domain alignment and reduce domain shifts between these domains by leveraging their unique cha… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  13. arXiv:2308.05449  [pdf, other

    eess.IV cs.CV

    Transforming Breast Cancer Diagnosis: Towards Real-Time Ultrasound to Mammogram Conversion for Cost-Effective Diagnosis

    Authors: Sahar Almahfouz Nasser, Ashutosh Sharma, Anmol Saraf, Amruta Mahendra Parulekar, Purvi Haria, Amit Sethi

    Abstract: Ultrasound (US) imaging is better suited for intraoperative settings because it is real-time and more portable than other imaging techniques, such as mammography. However, US images are characterized by lower spatial resolution noise-like artifacts. This research aims to address these limitations by providing surgeons with mammogram-like image quality in real-time from noisy US images. Unlike prev… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  14. arXiv:2307.10698  [pdf, other

    cs.CV

    Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data

    Authors: Sahar Almahfouz Nasser, Nihar Gupte, Amit Sethi

    Abstract: Retinal image matching plays a crucial role in monitoring disease progression and treatment response. However, datasets with matched keypoints between temporally separated pairs of images are not available in abundance to train transformer-based model. We propose a novel approach based on reverse knowledge distillation to train large models with limited data while preventing overfitting. Firstly,… ▽ More

    Submitted 21 July, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

  15. arXiv:2307.08132  [pdf, other

    cs.CV cs.AI cs.LG

    Heterogeneous graphs model spatial relationships between biological entities for breast cancer diagnosis

    Authors: Akhila Krishna K, Ravi Kant Gupta, Nikhil Cherian Kurian, Pranav Jeevan, Amit Sethi

    Abstract: The heterogeneity of breast cancer presents considerable challenges for its early detection, prognosis, and treatment selection. Convolutional neural networks often neglect the spatial relationships within histopathological images, which can limit their accuracy. Graph neural networks (GNNs) offer a promising solution by coding the spatial relationships within images. Prior studies have investigat… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

  16. arXiv:2307.00430  [pdf, other

    cs.CV cs.AI

    WaveMixSR: A Resource-efficient Neural Network for Image Super-resolution

    Authors: Pranav Jeevan, Akella Srinidhi, Pasunuri Prathiba, Amit Sethi

    Abstract: Image super-resolution research recently been dominated by transformer models which need higher computational resources than CNNs due to the quadratic complexity of self-attention. We propose a new neural network -- WaveMixSR -- for image super-resolution based on WaveMix architecture which uses a 2D-discrete wavelet transform for spatial token-mixing. Unlike transformer-based models, WaveMixSR do… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: 10 pages, 3 figures

    ACM Class: I.2.10; I.4.0; I.4.1; I.4.2; I.4.6; I.4.7; I.4.8; I.4.9; I.4.10; I.2.10; I.5.1; I.5.2; I.5.4; I.4.3; I.4.4; I.4.5

  17. arXiv:2307.00407  [pdf, other

    cs.CV cs.AI

    WavePaint: Resource-efficient Token-mixer for Self-supervised Inpainting

    Authors: Pranav Jeevan, Dharshan Sampath Kumar, Amit Sethi

    Abstract: Image inpainting, which refers to the synthesis of missing regions in an image, can help restore occluded or degraded areas and also serve as a precursor task for self-supervision. The current state-of-the-art models for image inpainting are computationally heavy as they are based on transformer or CNN backbones that are trained in adversarial or diffusion settings. This paper diverges from vision… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: 11 pages, 7 figures

    ACM Class: I.2.10; I.4.0; I.4.4; I.4.3; I.4.5; I.4.1; I.4.2; I.4.6; I.4.7; I.4.8; I.4.9; I.4.10; I.2.10; I.5.1; I.5.2; I.5.4

  18. The ACROBAT 2022 Challenge: Automatic Registration Of Breast Cancer Tissue

    Authors: Philippe Weitz, Masi Valkonen, Leslie Solorzano, Circe Carr, Kimmo Kartasalo, Constance Boissin, Sonja Koivukoski, Aino Kuusela, Dusan Rasic, Yanbo Feng, Sandra Sinius Pouplier, Abhinav Sharma, Kajsa Ledesma Eriksson, Stephanie Robertson, Christian Marzahl, Chandler D. Gatenbee, Alexander R. A. Anderson, Marek Wodzinski, Artur Jurgas, Niccolò Marini, Manfredo Atzori, Henning Müller, Daniel Budelmann, Nick Weiss, Stefan Heldmann , et al. (16 additional authors not shown)

    Abstract: The alignment of tissue between histopathological whole-slide-images (WSI) is crucial for research and clinical applications. Advances in computing, deep learning, and availability of large WSI datasets have revolutionised WSI analysis. Therefore, the current state-of-the-art in WSI registration is unclear. To address this, we conducted the ACROBAT challenge, based on the largest WSI registration… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

  19. arXiv:2304.09623  [pdf, other

    cs.CV eess.IV

    CHATTY: Coupled Holistic Adversarial Transport Terms with Yield for Unsupervised Domain Adaptation

    Authors: Chirag P, Mukta Wagle, Ravi Kant Gupta, Pranav Jeevan, Amit Sethi

    Abstract: We propose a new technique called CHATTY: Coupled Holistic Adversarial Transport Terms with Yield for Unsupervised Domain Adaptation. Adversarial training is commonly used for learning domain-invariant representations by reversing the gradients from a domain discriminator head to train the feature extractor layers of a neural network. We propose significant modifications to the adversarial head, i… ▽ More

    Submitted 20 April, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: 10 pages, 4 figures

    ACM Class: I.4.0; I.4.10; I.2.0; I.2.10

  20. arXiv:2303.09930  [pdf, other

    cs.CV

    Robust Semi-Supervised Learning for Histopathology Images through Self-Supervision Guided Out-of-Distribution Scoring

    Authors: Nikhil Cherian Kurian, Varsha S, Abhijit Patil, Shashikant Khade, Amit Sethi

    Abstract: Semi-supervised learning (semi-SL) is a promising alternative to supervised learning for medical image analysis when obtaining good quality supervision for medical imaging is difficult. However, semi-SL assumes that the underlying distribution of unaudited data matches that of the few labeled samples, which is often violated in practical settings, particularly in medical images. The presence of ou… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

  21. arXiv:2302.11488  [pdf, other

    eess.IV cs.CV

    Magnification Invariant Medical Image Analysis: A Comparison of Convolutional Networks, Vision Transformers, and Token Mixers

    Authors: Pranav Jeevan, Nikhil Cherian Kurian, Amit Sethi

    Abstract: Convolution Neural Networks (CNNs) are widely used in medical image analysis, but their performance degrade when the magnification of testing images differ from the training images. The inability of CNNs to generalize across magnification scales can result in sub-optimal performance on external datasets. This study aims to evaluate the robustness of various deep learning architectures in the analy… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: 6 pages, 3 figures

    ACM Class: I.2.1; I.4.0; I.4.8; I.4.9; I.4.10; I.5.1; I.5.2; I.5.4; I.5.5; J.3

  22. arXiv:2211.15667  [pdf, other

    q-bio.QM cs.CV eess.IV

    Artificial Intelligence-based Eosinophil Counting in Gastrointestinal Biopsies

    Authors: Harsh Shah, Thomas Jacob, Amruta Parulekar, Anjali Amarapurkar, Amit Sethi

    Abstract: Normally eosinophils are present in the gastrointestinal (GI) tract of healthy individuals. When the eosinophils increase beyond their usual amount in the GI tract, a patient gets varied symptoms. Clinicians find it difficult to diagnose this condition called eosinophilia. Early diagnosis can help in treating patients. Histopathology is the gold standard in the diagnosis for this condition. As thi… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: 4 pages, 2 figures

  23. arXiv:2209.09193  [pdf, other

    cs.CV

    Improving Mitosis Detection Via UNet-based Adversarial Domain Homogenizer

    Authors: Tirupati Saketh Chandr, Sahar Almahfouz Nasser, Nikhil Cherian Kurian, Amit Sethi

    Abstract: The effective localization of mitosis is a critical precursory task for deciding tumor prognosis and grade. Automated mitosis detection through deep learning-oriented image analysis often fails on unseen patient data due to inherent domain biases. This paper proposes a domain homogenizer for mitosis detection that attempts to alleviate domain differences in histology images via adversarial reconst… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

  24. arXiv:2208.12506  [pdf, other

    cs.CV cs.AI cs.LG

    EGFR Mutation Prediction of Lung Biopsy Images using Deep Learning

    Authors: Ravi Kant Gupta, Shivani Nandgaonkar, Nikhil Cherian Kurian, Swapnil Rane, Amit Sethi

    Abstract: The standard diagnostic procedures for targeted therapies in lung cancer treatment involve histological subtyping and subsequent detection of key driver mutations, such as EGFR. Even though molecular profiling can uncover the driver mutation, the process is often expensive and time-consuming. Deep learning-oriented image analysis offers a more economical alternative for discovering driver mutation… ▽ More

    Submitted 13 March, 2023; v1 submitted 26 August, 2022; originally announced August 2022.

    Comments: We need to improve

    ACM Class: I.4.0; I.4.6; I.4.10; J.3; I.2.10

  25. arXiv:2207.08492  [pdf, other

    eess.SY cs.RO

    Shallow Water Bathymetry Survey using an Autonomous Surface Vehicle

    Authors: Bibin Wilson, Anand Singh, Amit Sethi

    Abstract: Accurate and cost effective mapping of water bodies has an enormous significance for environmental understanding and navigation. However, the quantity and quality of information we acquire from such environmental features is limited by various factors, including cost, time, security, and the capabilities of existing data collection techniques. Measurement of water depth is an important part of suc… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  26. arXiv:2207.01811  [pdf, other

    physics.geo-ph cs.CV cs.LG eess.SP

    Deriving Surface Resistivity from Polarimetric SAR Data Using Dual-Input UNet

    Authors: Bibin Wilson, Rajiv Kumar, Narayanarao Bhogapurapu, Anand Singh, Amit Sethi

    Abstract: Traditional survey methods for finding surface resistivity are time-consuming and labor intensive. Very few studies have focused on finding the resistivity/conductivity using remote sensing data and deep learning techniques. In this line of work, we assessed the correlation between surface resistivity and Synthetic Aperture Radar (SAR) by applying various deep learning methods and tested our hypot… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

  27. arXiv:2205.14375  [pdf, other

    cs.CV cs.AI cs.LG

    WaveMix: A Resource-efficient Neural Network for Image Analysis

    Authors: Pranav Jeevan, Kavitha Viswanathan, Anandu A S, Amit Sethi

    Abstract: We propose a novel neural architecture for computer vision -- WaveMix -- that is resource-efficient and yet generalizable and scalable. While using fewer trainable parameters, GPU RAM, and computations, WaveMix networks achieve comparable or better accuracy than the state-of-the-art convolutional neural networks, vision transformers, and token mixers for several tasks. This efficiency can translat… ▽ More

    Submitted 30 March, 2024; v1 submitted 28 May, 2022; originally announced May 2022.

    Comments: 20 pages, 5 figures

    ACM Class: I.2.10; I.4.0; I.4.1; I.4.2; I.4.6; I.4.7; I.4.8; I.4.9; I.4.10; I.2.10; I.5.1; I.5.2; I.5.4; J.2

  28. arXiv:2205.01777  [pdf, other

    eess.IV cs.CV

    Deep Multi-Scale U-Net Architecture and Label-Noise Robust Training Strategies for Histopathological Image Segmentation

    Authors: Nikhil Cherian Kurian, Amit Lohan, Gregory Verghese, Nimish Dharamshi, Swati Meena, Mengyuan Li, Fangfang Liu, Cheryl Gillet, Swapnil Rane, Anita Grigoriadis, Amit Sethi

    Abstract: Although the U-Net architecture has been extensively used for segmentation of medical images, we address two of its shortcomings in this work. Firstly, the accuracy of vanilla U-Net degrades when the target regions for segmentation exhibit significant variations in shape and size. Even though the U-Net already possesses some capability to analyze features at various scales, we propose to explicitl… ▽ More

    Submitted 13 August, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

    Comments: 12 pages, 4 figures , 2 tables ,Added Attention UNet Results, Added Sinus and Germinal Center overlay images, Modified paper format, Fixed Title typos

  29. arXiv:2203.07114  [pdf, other

    eess.IV cs.CV

    WSSAMNet: Weakly Supervised Semantic Attentive Medical Image Registration Network

    Authors: Sahar Almahfouz Nasser, Nikhil Cherian Kurian, Saqib Shamsi, Mohit Meena, Amit Sethi

    Abstract: We present WSSAMNet, a weakly supervised method for medical image registration. Ours is a two step method, with the first step being the computation of segmentation masks of the fixed and moving volumes. These masks are then used to attend to the input volume, which are then provided as inputs to a registration network in the second step. The registration network computes the deformation field to… ▽ More

    Submitted 5 March, 2022; originally announced March 2022.

  30. arXiv:2203.03689  [pdf, other

    cs.CV cs.AI cs.LG

    WaveMix: Resource-efficient Token Mixing for Images

    Authors: Pranav Jeevan, Amit Sethi

    Abstract: Although certain vision transformer (ViT) and CNN architectures generalize well on vision tasks, it is often impractical to use them on green, edge, or desktop computing due to their computational requirements for training and even testing. We present WaveMix as an alternative neural architecture that uses a multi-scale 2D discrete wavelet transform (DWT) for spatial token mixing. Unlike ViTs, Wav… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: 12 pages, 2 figures

    ACM Class: I.4.0; I.4.1; I.4.7; I.4.8; I.4.9; I.4.10; I.2.10; I.5.1; I.5.2; I.5.4

  31. arXiv:2201.10271  [pdf, other

    cs.CV cs.AI cs.LG

    Convolutional Xformers for Vision

    Authors: Pranav Jeevan, Amit sethi

    Abstract: Vision transformers (ViTs) have found only limited practical use in processing images, in spite of their state-of-the-art accuracy on certain benchmarks. The reason for their limited use include their need for larger training datasets and more computational resources compared to convolutional neural networks (CNNs), owing to the quadratic complexity of their self-attention mechanism. We propose a… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: 9 pages, 3 figures

    ACM Class: I.4.0; I.4.1; I.4.7; I.4.8; I.4.9; I.4.10; I.2.10; I.5.1; I.5.2; I.5.4

  32. arXiv:2201.09314  [pdf

    eess.IV cs.CV cs.LG

    Perceptual cGAN for MRI Super-resolution

    Authors: Sahar Almahfouz Nasser, Saqib Shamsi, Valay Bundele, Bhavesh Garg, Amit Sethi

    Abstract: Capturing high-resolution magnetic resonance (MR) images is a time consuming process, which makes it unsuitable for medical emergencies and pediatric patients. Low-resolution MR imaging, by contrast, is faster than its high-resolution counterpart, but it compromises on fine details necessary for a more precise diagnosis. Super-resolution (SR), when applied to low-resolution MR images, can help inc… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

  33. arXiv:2112.06979  [pdf, other

    eess.IV cs.CV

    The Brain Tumor Sequence Registration (BraTS-Reg) Challenge: Establishing Correspondence Between Pre-Operative and Follow-up MRI Scans of Diffuse Glioma Patients

    Authors: Bhakti Baheti, Satrajit Chakrabarty, Hamed Akbari, Michel Bilello, Benedikt Wiestler, Julian Schwarting, Evan Calabrese, Jeffrey Rudie, Syed Abidi, Mina Mousa, Javier Villanueva-Meyer, Brandon K. K. Fields, Florian Kofler, Russell Takeshi Shinohara, Juan Eugenio Iglesias, Tony C. W. Mok, Albert C. S. Chung, Marek Wodzinski, Artur Jurgas, Niccolo Marini, Manfredo Atzori, Henning Muller, Christoph Grobroehmer, Hanna Siebert, Lasse Hansen , et al. (48 additional authors not shown)

    Abstract: Registration of longitudinal brain MRI scans containing pathologies is challenging due to dramatic changes in tissue appearance. Although there has been progress in developing general-purpose medical image registration techniques, they have not yet attained the requisite precision and reliability for this task, highlighting its inherent complexity. Here we describe the Brain Tumor Sequence Registr… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 December, 2021; originally announced December 2021.

  34. arXiv:2110.10969  [pdf, other

    cs.LG cs.CV cs.NE

    Memory Efficient Adaptive Attention For Multiple Domain Learning

    Authors: Himanshu Pradeep Aswani, Abhiraj Sunil Kanse, Shubhang Bhatnagar, Amit Sethi

    Abstract: Training CNNs from scratch on new domains typically demands large numbers of labeled images and computations, which is not suitable for low-power hardware. One way to reduce these requirements is to modularize the CNN architecture and freeze the weights of the heavier modules, that is, the lower layers after pre-training. Recent studies have proposed alternative modular architectures and schemes t… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: 13 pages, 3 figures, 4 graphs, 3 tables

  35. arXiv:2107.02239  [pdf, other

    cs.CV cs.AI cs.CC cs.LG

    Vision Xformers: Efficient Attention for Image Classification

    Authors: Pranav Jeevan, Amit Sethi

    Abstract: Although transformers have become the neural architectures of choice for natural language processing, they require orders of magnitude more training data, GPU memory, and computations in order to compete with convolutional neural networks for computer vision. The attention mechanism of transformers scales quadratically with the length of the input sequence, and unrolled images have long sequence l… ▽ More

    Submitted 1 October, 2021; v1 submitted 5 July, 2021; originally announced July 2021.

    Comments: 11 pages, 4 figures

    ACM Class: I.4.0; I.4.1; I.4.7; I.4.8; I.4.9; I.4.10; I.2.10; I.5.1; I.5.2; I.5.4

  36. arXiv:2104.09088  [pdf, other

    cs.CL cs.LG

    Alexa Conversations: An Extensible Data-driven Approach for Building Task-oriented Dialogue Systems

    Authors: Anish Acharya, Suranjit Adhikari, Sanchit Agarwal, Vincent Auvray, Nehal Belgamwar, Arijit Biswas, Shubhra Chandra, Tagyoung Chung, Maryam Fazel-Zarandi, Raefer Gabriel, Shuyang Gao, Rahul Goel, Dilek Hakkani-Tur, Jan Jezabek, Abhay Jha, Jiun-Yu Kao, Prakash Krishnan, Peter Ku, Anuj Goyal, Chien-Wei Lin, Qing Liu, Arindam Mandal, Angeliki Metallinou, Vishal Naik, Yi Pan , et al. (6 additional authors not shown)

    Abstract: Traditional goal-oriented dialogue systems rely on various components such as natural language understanding, dialogue state tracking, policy learning and response generation. Training each component requires annotations which are hard to obtain for every new domain, limiting scalability of such systems. Similarly, rule-based dialogue systems require extensive writing and maintenance of rules and… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Journal ref: NAACL 2021 System Demonstrations Track

  37. arXiv:2011.15000  [pdf, other

    cs.CV cs.LG eess.IV

    Fast, Self Supervised, Fully Convolutional Color Normalization of H&E Stained Images

    Authors: Abhijeet Patil, Mohd. Talha, Aniket Bhatia, Nikhil Cherian Kurian, Sammed Mangale, Sunil Patel, Amit Sethi

    Abstract: Performance of deep learning algorithms decreases drastically if the data distributions of the training and testing sets are different. Due to variations in staining protocols, reagent brands, and habits of technicians, color variation in digital histopathology images is quite common. Color variation causes problems for the deployment of deep learning-based solutions for automatic diagnosis system… ▽ More

    Submitted 30 November, 2020; originally announced November 2020.

    Comments: --

  38. arXiv:2010.15947  [pdf, other

    cs.CV cs.LG

    PAL : Pretext-based Active Learning

    Authors: Shubhang Bhatnagar, Sachin Goyal, Darshan Tank, Amit Sethi

    Abstract: The goal of pool-based active learning is to judiciously select a fixed-sized subset of unlabeled samples from a pool to query an oracle for their labels, in order to maximize the accuracy of a supervised learner. However, the unsaid requirement that the oracle should always assign correct labels is unreasonable for most situations. We propose an active learning technique for deep neural networks… ▽ More

    Submitted 28 March, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

  39. arXiv:2009.07793  [pdf, other

    cs.LG stat.ML

    Activation Functions: Do They Represent A Trade-Off Between Modular Nature of Neural Networks And Task Performance

    Authors: Himanshu Pradeep Aswani, Amit Sethi

    Abstract: Current research suggests that the key factors in designing neural network architectures involve choosing number of filters for every convolution layer, number of hidden neurons for every fully connected layer, dropout and pruning. The default activation function in most cases is the ReLU, as it has empirically shown faster training convergence. We explore whether ReLU is the best choice if one is… ▽ More

    Submitted 16 September, 2020; originally announced September 2020.

    Comments: 5 pages, 1 figure, 2 tables, pre-print

  40. arXiv:2009.06136  [pdf, other

    cs.GT

    Convergence Analysis of No-Regret Bidding Algorithms in Repeated Auctions

    Authors: Zhe Feng, Guru Guruganesh, Christopher Liaw, Aranyak Mehta, Abhishek Sethi

    Abstract: The connection between games and no-regret algorithms has been widely studied in the literature. A fundamental result is that when all players play no-regret strategies, this produces a sequence of actions whose time-average is a coarse-correlated equilibrium of the game. However, much less is known about equilibrium selection in the case that multiple equilibria exist. In this work, we study th… ▽ More

    Submitted 13 September, 2020; originally announced September 2020.

  41. arXiv:2008.09983  [pdf, other

    cs.LG cs.DB stat.ML

    Leveraging Organizational Resources to Adapt Models to New Data Modalities

    Authors: Sahaana Suri, Raghuveer Chanda, Neslihan Bulut, Pradyumna Narayana, Yemao Zeng, Peter Bailis, Sugato Basu, Girija Narlikar, Christopher Re, Abishek Sethi

    Abstract: As applications in large organizations evolve, the machine learning (ML) models that power them must adapt the same predictive tasks to newly arising data modalities (e.g., a new video content launch in a social media application requires existing text or image models to extend to video). To solve this problem, organizations typically create ML pipelines from scratch. However, this fails to utiliz… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

    Journal ref: PVLDB,13(12): 3396-3410, 2020

  42. arXiv:2008.03750  [pdf, other

    eess.IV cs.CV

    Switching Loss for Generalized Nucleus Detection in Histopathology

    Authors: Deepak Anand, Gaurav Patel, Yaman Dang, Amit Sethi

    Abstract: The accuracy of deep learning methods for two foundational tasks in medical image analysis -- detection and segmentation -- can suffer from class imbalance. We propose a `switching loss' function that adaptively shifts the emphasis between foreground and background classes. While the existing loss functions to address this problem were motivated by the classification task, the switching loss is ba… ▽ More

    Submitted 9 August, 2020; originally announced August 2020.

  43. arXiv:2006.09464  [pdf, other

    eess.IV cs.LG q-bio.QM

    Visualization for Histopathology Images using Graph Convolutional Neural Networks

    Authors: Mookund Sureka, Abhijeet Patil, Deepak Anand, Amit Sethi

    Abstract: With the increase in the use of deep learning for computer-aided diagnosis in medical images, the criticism of the black-box nature of the deep learning models is also on the rise. The medical community needs interpretable models for both due diligence and advancing the understanding of disease and treatment mechanisms. In histology, in particular, while there is rich detail available at the cellu… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: 5 pages, 3 Figures

  44. arXiv:2005.11797  [pdf, other

    cs.LG cs.AI stat.ML

    Functional Space Variational Inference for Uncertainty Estimation in Computer Aided Diagnosis

    Authors: Pranav Poduval, Hrushikesh Loya, Amit Sethi

    Abstract: Deep neural networks have revolutionized medical image analysis and disease diagnosis. Despite their impressive performance, it is difficult to generate well-calibrated probabilistic outputs for such networks, which makes them uninterpretable black boxes. Bayesian neural networks provide a principled approach for modelling uncertainty and increasing patient safety, but they have a large computatio… ▽ More

    Submitted 28 May, 2020; v1 submitted 24 May, 2020; originally announced May 2020.

    Comments: Meaningful priors on the functional space rather than the weight space, result in well calibrated uncertainty estimates

    Report number: MIDL/2020/ExtendedAbstract/eLL-c_Xc0B

    Journal ref: Medical Imaging with Deep Learning 2020

  45. arXiv:2005.05513  [pdf, other

    cs.CL cs.CY cs.SI

    Psychometric Analysis and Coupling of Emotions Between State Bulletins and Twitter in India during COVID-19 Infodemic

    Authors: Baani Leen Kaur Jolly, Palash Aggrawal, Amogh Gulati, Amarjit Singh Sethi, Ponnurangam Kumaraguru, Tavpritesh Sethi

    Abstract: COVID-19 infodemic has been spreading faster than the pandemic itself. The misinformation riding upon the infodemic wave poses a major threat to people's health and governance systems. Since social media is the largest source of information, managing the infodemic not only requires mitigating of misinformation but also an early understanding of psychological patterns resulting from it. During the… ▽ More

    Submitted 13 May, 2020; v1 submitted 11 May, 2020; originally announced May 2020.

  46. arXiv:2004.11430  [pdf, other

    cs.SI physics.soc-ph q-bio.PE

    Mobile phone location data reveal the effect and geographic variation of social distancing on the spread of the COVID-19 epidemic

    Authors: Song Gao, Jinmeng Rao, Yuhao Kang, Yunlei Liang, Jake Kruse, Doerte Doepfer, Ajay K. Sethi, Juan Francisco Mandujano Reyes, Jonathan Patz, Brian S. Yandell

    Abstract: The emergence of SARS-CoV-2 and the coronavirus infectious disease (COVID-19) has become a pandemic. Social (physical) distancing is a key non-pharmacologic control measure to reduce the transmission rate of SARS-COV-2, but high-level adherence is needed. Using daily travel distance and stay-at-home time derived from large-scale anonymous mobile phone location data provided by Descartes Labs and S… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

    Comments: 17 pages, 4 figures, 1 table

    MSC Class: 65D10 ACM Class: H.4; G.3; J.2

    Journal ref: JAMA Network Open. 2020;3(9):e2020485

  47. arXiv:2004.02498  [pdf, other

    cs.CV q-bio.PE

    Image-based phenotyping of diverse Rice (Oryza Sativa L.) Genotypes

    Authors: Mukesh Kumar Vishal, Dipesh Tamboli, Abhijeet Patil, Rohit Saluja, Biplab Banerjee, Amit Sethi, Dhandapani Raju, Sudhir Kumar, R N Sahoo, Viswanathan Chinnusamy, J Adinarayana

    Abstract: Development of either drought-resistant or drought-tolerant varieties in rice (Oryza sativa L.), especially for high yield in the context of climate change, is a crucial task across the world. The need for high yielding rice varieties is a prime concern for developing nations like India, China, and other Asian-African countries where rice is a primary staple food. The present investigation is carr… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

    Comments: Paper presented at the ICLR 2020 Workshop on Computer Vision for Agriculture (CV4A)

  48. arXiv:2003.08573  [pdf, other

    cs.LG stat.AP stat.ML

    Uncertainty Estimation in Cancer Survival Prediction

    Authors: Hrushikesh Loya, Pranav Poduval, Deepak Anand, Neeraj Kumar, Amit Sethi

    Abstract: Survival models are used in various fields, such as the development of cancer treatment protocols. Although many statistical and machine learning models have been proposed to achieve accurate survival predictions, little attention has been paid to obtain well-calibrated uncertainty estimates associated with each prediction. The currently popular models are opaque and untrustworthy in that they oft… ▽ More

    Submitted 25 March, 2020; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: 5 pages, Accepted at AI4AH Workshop at ICLR 2020

  49. arXiv:2003.00823  [pdf, other

    cs.CV cs.LG stat.ML

    Breast Cancer Histopathology Image Classification and Localization using Multiple Instance Learning

    Authors: Abhijeet Patil, Dipesh Tamboli, Swati Meena, Deepak Anand, Amit Sethi

    Abstract: Breast cancer has the highest mortality among cancers in women. Computer-aided pathology to analyze microscopic histopathology images for diagnosis with an increasing number of breast cancer patients can bring the cost and delays of diagnosis down. Deep learning in histopathology has attracted attention over the last decade of achieving state-of-the-art performance in classification and localizati… ▽ More

    Submitted 16 February, 2020; originally announced March 2020.

    Comments: Accepted in 2019 5th IEEE International WIE Conference on Electrical and Computer Engineering (WIECON-ECE) and Awarded as best paper

  50. arXiv:1911.07309  [pdf, other

    cs.LG stat.ML

    Coverage Testing of Deep Learning Models using Dataset Characterization

    Authors: Senthil Mani, Anush Sankaran, Srikanth Tamilselvam, Akshay Sethi

    Abstract: Deep Neural Networks (DNNs), with its promising performance, are being increasingly used in safety critical applications such as autonomous driving, cancer detection, and secure authentication. With growing importance in deep learning, there is a requirement for a more standardized framework to evaluate and test deep learning models. The primary challenge involved in automated generation of extens… ▽ More

    Submitted 17 November, 2019; originally announced November 2019.