Skip to main content

Showing 1–7 of 7 results for author: Venkataramanan, S

  1. arXiv:2311.05538  [pdf, other

    cs.LG cs.CV

    Embedding Space Interpolation Beyond Mini-Batch, Beyond Pairs and Beyond Examples

    Authors: Shashanka Venkataramanan, Ewa Kijak, Laurent Amsaleg, Yannis Avrithis

    Abstract: Mixup refers to interpolation-based data augmentation, originally motivated as a way to go beyond empirical risk minimization (ERM). Its extensions mostly focus on the definition of interpolation and the space (input or feature) where it takes place, while the augmentation process itself is less studied. In most methods, the number of generated examples is limited to the mini-batch size and the nu… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Accepted to NeurIPS 2023. arXiv admin note: substantial text overlap with arXiv:2206.14868

  2. arXiv:2310.08584  [pdf, other

    cs.CV

    Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video

    Authors: Shashanka Venkataramanan, Mamshad Nayeem Rizve, João Carreira, Yuki M. Asano, Yannis Avrithis

    Abstract: Self-supervised learning has unlocked the potential of scaling up pretraining to billions of images, since annotation is unnecessary. But are we making the best use of data? How more economical can we be? In this work, we attempt to answer this question by making two contributions. First, we investigate first-person videos and introduce a "Walking Tours" dataset. These videos are high-resolution,… ▽ More

    Submitted 23 May, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024 (Best paper honorable mention). Project Page: https://shashankvkt.github.io/dora

  3. arXiv:2301.02240  [pdf, other

    cs.CV

    Skip-Attention: Improving Vision Transformers by Paying Less Attention

    Authors: Shashanka Venkataramanan, Amir Ghodrati, Yuki M. Asano, Fatih Porikli, Amirhossein Habibian

    Abstract: This work aims to improve the efficiency of vision transformers (ViT). While ViTs use computationally expensive self-attention operations in every layer, we identify that these operations are highly correlated across layers -- a key redundancy that causes unnecessary computations. Based on this observation, we propose SkipAt, a method to reuse self-attention computation from preceding layers to ap… ▽ More

    Submitted 17 January, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

  4. arXiv:2206.14868  [pdf, other

    cs.LG cs.CV

    Teach me how to Interpolate a Myriad of Embeddings

    Authors: Shashanka Venkataramanan, Ewa Kijak, Laurent Amsaleg, Yannis Avrithis

    Abstract: Mixup refers to interpolation-based data augmentation, originally motivated as a way to go beyond empirical risk minimization (ERM). Yet, its extensions focus on the definition of interpolation and the space where it takes place, while the augmentation itself is less studied: For a mini-batch of size $m$, most methods interpolate between $m$ pairs with a single scalar interpolation factor $λ$. I… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

  5. arXiv:2106.04990  [pdf, other

    cs.LG cs.CV

    It Takes Two to Tango: Mixup for Deep Metric Learning

    Authors: Shashanka Venkataramanan, Bill Psomas, Ewa Kijak, Laurent Amsaleg, Konstantinos Karantzalos, Yannis Avrithis

    Abstract: Metric learning involves learning a discriminative representation such that embeddings of similar classes are encouraged to be close, while embeddings of dissimilar classes are pushed far apart. State-of-the-art methods focus mostly on sophisticated loss functions or mining strategies. On the one hand, metric learning losses consider two or more examples at a time. On the other hand, modern data a… ▽ More

    Submitted 28 February, 2022; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted to ICLR 2022

  6. arXiv:2103.15375  [pdf, other

    cs.CV

    AlignMixup: Improving Representations By Interpolating Aligned Features

    Authors: Shashanka Venkataramanan, Ewa Kijak, Laurent Amsaleg, Yannis Avrithis

    Abstract: Mixup is a powerful data augmentation method that interpolates between two or more examples in the input or feature space and between the corresponding target labels. Many recent mixup methods focus on cutting and pasting two or more objects into one image, which is more about efficient processing than interpolation. However, how to best interpolate images is not well defined. In this sense, mixup… ▽ More

    Submitted 25 March, 2022; v1 submitted 29 March, 2021; originally announced March 2021.

    Comments: Accepted to CVPR 2022

  7. arXiv:1911.08616  [pdf, other

    cs.CV eess.IV

    Attention Guided Anomaly Localization in Images

    Authors: Shashanka Venkataramanan, Kuan-Chuan Peng, Rajat Vikram Singh, Abhijit Mahalanobis

    Abstract: Anomaly localization is an important problem in computer vision which involves localizing anomalous regions within images with applications in industrial inspection, surveillance, and medical imaging. This task is challenging due to the small sample size and pixel coverage of the anomaly in real-world scenarios. Most prior works need to use anomalous training images to compute a class-specific thr… ▽ More

    Submitted 16 July, 2020; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: Accepted to ECCV 2020