Skip to main content

Showing 1–24 of 24 results for author: Srinivas, A

  1. arXiv:2312.07352  [pdf, other

    cs.CV cs.AI

    CholecTrack20: A Dataset for Multi-Class Multiple Tool Tracking in Laparoscopic Surgery

    Authors: Chinedu Innocent Nwoye, Kareem Elgohary, Anvita Srinivas, Fauzan Zaid, Joël L. Lavanchy, Nicolas Padoy

    Abstract: Tool tracking in surgical videos is vital in computer-assisted intervention for tasks like surgeon skill assessment, safety zone estimation, and human-machine collaboration during minimally invasive procedures. The lack of large-scale datasets hampers Artificial Intelligence implementation in this domain. Current datasets exhibit overly generic tracking formalization, often lacking surgical contex… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: Surgical tool tracking dataset paper, 15 pages, 9 figures, 4 tables

  2. arXiv:2301.06294  [pdf, other

    cs.AI cs.LG cs.SC

    Neuro-Symbolic World Models for Adapting to Open World Novelty

    Authors: Jonathan Balloch, Zhiyu Lin, Robert Wright, Xiangyu Peng, Mustafa Hussain, Aarun Srinivas, Julia Kim, Mark O. Riedl

    Abstract: Open-world novelty--a sudden change in the mechanics or properties of an environment--is a common occurrence in the real world. Novelty adaptation is an agent's ability to improve its policy performance post-novelty. Most reinforcement learning (RL) methods assume that the world is a closed, fixed process. Consequentially, RL policies adapt inefficiently to novelties. To address this, we introduce… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Comments: 9 pages, 8 figures, Extended Abstract accepted for presentation at AAMAS 2023

  3. arXiv:2203.12117  [pdf, other

    cs.AI cs.LG

    NovGrid: A Flexible Grid World for Evaluating Agent Response to Novelty

    Authors: Jonathan Balloch, Zhiyu Lin, Mustafa Hussain, Aarun Srinivas, Robert Wright, Xiangyu Peng, Julia Kim, Mark Riedl

    Abstract: A robust body of reinforcement learning techniques have been developed to solve complex sequential decision making problems. However, these methods assume that train and evaluation tasks come from similarly or identically distributed environments. This assumption does not hold in real life where small novel changes to the environment can make a previously learned policy fail or introduce simpler s… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

    Comments: 7 pages, 4 figures, AAAI Spring Symposium 2022 on Designing Artificial Intelligence for Open Worlds (Long Oral)

  4. arXiv:2106.01345  [pdf, other

    cs.LG cs.AI

    Decision Transformer: Reinforcement Learning via Sequence Modeling

    Authors: Lili Chen, Kevin Lu, Aravind Rajeswaran, Kimin Lee, Aditya Grover, Michael Laskin, Pieter Abbeel, Aravind Srinivas, Igor Mordatch

    Abstract: We introduce a framework that abstracts Reinforcement Learning (RL) as a sequence modeling problem. This allows us to draw upon the simplicity and scalability of the Transformer architecture, and associated advances in language modeling such as GPT-x and BERT. In particular, we present Decision Transformer, an architecture that casts the problem of RL as conditional sequence modeling. Unlike prior… ▽ More

    Submitted 24 June, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: First two authors contributed equally. Last two authors advised equally

  5. arXiv:2104.10157  [pdf, other

    cs.CV cs.LG

    VideoGPT: Video Generation using VQ-VAE and Transformers

    Authors: Wilson Yan, Yunzhi Zhang, Pieter Abbeel, Aravind Srinivas

    Abstract: We present VideoGPT: a conceptually simple architecture for scaling likelihood based generative modeling to natural videos. VideoGPT uses VQ-VAE that learns downsampled discrete latent representations of a raw video by employing 3D convolutions and axial self-attention. A simple GPT-like architecture is then used to autoregressively model the discrete latents using spatio-temporal position encodin… ▽ More

    Submitted 14 September, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

    Comments: Project website: https://wilson1yan.github.io/videogpt/index.html

  6. arXiv:2104.00774  [pdf

    cs.RO

    Task-Invariant Learning of Continuous Joint Kinematics during Steady-State and Transient Ambulation Using Ultrasound Sensing

    Authors: M. Hassan Jahanandish, Kaitlin G. Rabe, Abhishek Srinivas, Nicholas P. Fey, Kenneth Hoyt

    Abstract: Natural control of limb motion is continuous and progressively adaptive to individual intent. While intuitive interfaces have the potential to rely on the neuromuscular input by the user for continuous adaptation, continuous volitional control of assistive devices that can generalize across various tasks has not been addressed. In this study, we propose a method to use spatiotemporal ultrasound fe… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

    Comments: Accepted for presentation at 2021 IEEE International Conference on Robotics and Automation (ICRA 2021)

  7. arXiv:2103.12731  [pdf, other

    cs.CV

    Scaling Local Self-Attention for Parameter Efficient Visual Backbones

    Authors: Ashish Vaswani, Prajit Ramachandran, Aravind Srinivas, Niki Parmar, Blake Hechtman, Jonathon Shlens

    Abstract: Self-attention has the promise of improving computer vision systems due to parameter-independent scaling of receptive fields and content-dependent interactions, in contrast to parameter-dependent scaling and content-independent interactions of convolutions. Self-attention models have recently been shown to have encouraging improvements on accuracy-parameter trade-offs compared to baseline convolut… ▽ More

    Submitted 7 June, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: CVPR 2021 Oral

  8. arXiv:2103.07579  [pdf, other

    cs.CV

    Revisiting ResNets: Improved Training and Scaling Strategies

    Authors: Irwan Bello, William Fedus, Xianzhi Du, Ekin D. Cubuk, Aravind Srinivas, Tsung-Yi Lin, Jonathon Shlens, Barret Zoph

    Abstract: Novel computer vision architectures monopolize the spotlight, but the impact of the model architecture is often conflated with simultaneous changes to training methodology and scaling strategies. Our work revisits the canonical ResNet (He et al., 2015) and studies these three aspects in an effort to disentangle them. Perhaps surprisingly, we find that training and scaling strategies may matter mor… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

  9. arXiv:2103.02886  [pdf, other

    cs.LG cs.CV

    Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings

    Authors: Lili Chen, Kimin Lee, Aravind Srinivas, Pieter Abbeel

    Abstract: Recent advances in off-policy deep reinforcement learning (RL) have led to impressive success in complex tasks from visual observations. Experience replay improves sample-efficiency by reusing experiences from the past, and convolutional neural networks (CNNs) process high-dimensional inputs effectively. However, such techniques demand high memory and computational bandwidth. In this paper, we pre… ▽ More

    Submitted 28 October, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

  10. arXiv:2101.11605  [pdf, other

    cs.CV cs.AI cs.LG

    Bottleneck Transformers for Visual Recognition

    Authors: Aravind Srinivas, Tsung-Yi Lin, Niki Parmar, Jonathon Shlens, Pieter Abbeel, Ashish Vaswani

    Abstract: We present BoTNet, a conceptually simple yet powerful backbone architecture that incorporates self-attention for multiple computer vision tasks including image classification, object detection and instance segmentation. By just replacing the spatial convolutions with global self-attention in the final three bottleneck blocks of a ResNet and no other changes, our approach improves upon the baseline… ▽ More

    Submitted 2 August, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: Technical Report, 20 pages, 13 figures, 19 tables

  11. arXiv:2101.01857  [pdf, other

    cs.LG

    Reinforcement Learning with Latent Flow

    Authors: Wenling Shang, Xiaofei Wang, Aravind Srinivas, Aravind Rajeswaran, Yang Gao, Pieter Abbeel, Michael Laskin

    Abstract: Temporal information is essential to learning effective policies with Reinforcement Learning (RL). However, current state-of-the-art RL algorithms either assume that such information is given as part of the state space or, when learning from pixels, use the simple heuristic of frame-stacking to implicitly capture temporal information present in the image observations. This heuristic is in contrast… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

  12. arXiv:2012.07177  [pdf, other

    cs.CV

    Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation

    Authors: Golnaz Ghiasi, Yin Cui, Aravind Srinivas, Rui Qian, Tsung-Yi Lin, Ekin D. Cubuk, Quoc V. Le, Barret Zoph

    Abstract: Building instance segmentation models that are data-efficient and can handle rare object categories is an important challenge in computer vision. Leveraging data augmentations is a promising direction towards addressing this challenge. Here, we perform a systematic study of the Copy-Paste augmentation ([13, 12]) for instance segmentation where we randomly paste objects onto an image. Prior studies… ▽ More

    Submitted 23 June, 2021; v1 submitted 13 December, 2020; originally announced December 2020.

    Comments: Accepted at CVPR 2021 (Oral)

  13. arXiv:2010.09163  [pdf, other

    cs.LG

    D2RL: Deep Dense Architectures in Reinforcement Learning

    Authors: Samarth Sinha, Homanga Bharadhwaj, Aravind Srinivas, Animesh Garg

    Abstract: While improvements in deep learning architectures have played a crucial role in improving the state of supervised and unsupervised learning in computer vision and natural language processing, neural network architecture choices for reinforcement learning remain relatively under-explored. We take inspiration from successful architectural choices in computer vision and generative modelling, and inve… ▽ More

    Submitted 27 November, 2020; v1 submitted 18 October, 2020; originally announced October 2020.

    Comments: NeurIPS 2020 Deep RL Workshop. Preprint. Under review

  14. arXiv:2009.07724  [pdf, other

    cs.CV cs.LG

    SelfAugment: Automatic Augmentation Policies for Self-Supervised Learning

    Authors: Colorado J Reed, Sean Metzger, Aravind Srinivas, Trevor Darrell, Kurt Keutzer

    Abstract: A common practice in unsupervised representation learning is to use labeled data to evaluate the quality of the learned representations. This supervised evaluation is then used to guide critical aspects of the training process such as selecting the data augmentation policy. However, guiding an unsupervised training process through supervised evaluations is not possible for real-world data that doe… ▽ More

    Submitted 17 May, 2021; v1 submitted 16 September, 2020; originally announced September 2020.

    Comments: Computer Vision and Pattern Recognition (CVPR), 2021

  15. arXiv:2007.04938  [pdf, other

    cs.LG cs.AI stat.ML

    SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning

    Authors: Kimin Lee, Michael Laskin, Aravind Srinivas, Pieter Abbeel

    Abstract: Off-policy deep reinforcement learning (RL) has been successful in a range of challenging domains. However, standard off-policy RL algorithms can suffer from several issues, such as instability in Q-learning and balancing exploration and exploitation. To mitigate these issues, we present SUNRISE, a simple unified ensemble method, which is compatible with various off-policy RL algorithms. SUNRISE i… ▽ More

    Submitted 11 June, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: ICML 2021 camera ready

  16. arXiv:2004.14990  [pdf, other

    cs.LG stat.ML

    Reinforcement Learning with Augmented Data

    Authors: Michael Laskin, Kimin Lee, Adam Stooke, Lerrel Pinto, Pieter Abbeel, Aravind Srinivas

    Abstract: Learning from visual observations is a fundamental yet challenging problem in Reinforcement Learning (RL). Although algorithmic advances combined with convolutional neural networks have proved to be a recipe for success, current methods are still lacking on two fronts: (a) data-efficiency of learning and (b) generalization to new environments. To this end, we present Reinforcement Learning with Au… ▽ More

    Submitted 5 November, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: NeurIPS 2020 camera-ready version. First two authors contributed equally, website: https://mishalaskin.github.io/rad code: https://github.com/MishaLaskin/rad and https://github.com/pokaxpoka/rad_procgen

  17. arXiv:2004.04136  [pdf, other

    cs.LG cs.CV stat.ML

    CURL: Contrastive Unsupervised Representations for Reinforcement Learning

    Authors: Aravind Srinivas, Michael Laskin, Pieter Abbeel

    Abstract: We present CURL: Contrastive Unsupervised Representations for Reinforcement Learning. CURL extracts high-level features from raw pixels using contrastive learning and performs off-policy control on top of the extracted features. CURL outperforms prior pixel-based methods, both model-based and model-free, on complex tasks in the DeepMind Control Suite and Atari Games showing 1.9x and 1.2x performan… ▽ More

    Submitted 21 September, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: First two authors contributed equally, website: https://mishalaskin.github.io/curl code: https://github.com/MishaLaskin/curl

  18. arXiv:1905.09272  [pdf, other

    cs.CV cs.LG

    Data-Efficient Image Recognition with Contrastive Predictive Coding

    Authors: Olivier J. Hénaff, Aravind Srinivas, Jeffrey De Fauw, Ali Razavi, Carl Doersch, S. M. Ali Eslami, Aaron van den Oord

    Abstract: Human observers can learn to recognize new categories of images from a handful of examples, yet doing so with artificial ones remains an open challenge. We hypothesize that data-efficient recognition is enabled by representations which make the variability in natural signals more predictable. We therefore revisit and improve Contrastive Predictive Coding, an unsupervised objective for learning suc… ▽ More

    Submitted 1 July, 2020; v1 submitted 22 May, 2019; originally announced May 2019.

  19. arXiv:1902.00275  [pdf, other

    cs.LG cs.NE stat.ML

    Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design

    Authors: Jonathan Ho, Xi Chen, Aravind Srinivas, Yan Duan, Pieter Abbeel

    Abstract: Flow-based generative models are powerful exact likelihood models with efficient sampling and inference. Despite their computational efficiency, flow-based models generally have much worse density modeling performance compared to state-of-the-art autoregressive models. In this paper, we investigate and improve upon three limiting design choices employed by flow-based models in prior work: the use… ▽ More

    Submitted 15 May, 2019; v1 submitted 1 February, 2019; originally announced February 2019.

    Comments: Accepted at ICML 2019

  20. arXiv:1804.00645  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Universal Planning Networks

    Authors: Aravind Srinivas, Allan Jabri, Pieter Abbeel, Sergey Levine, Chelsea Finn

    Abstract: A key challenge in complex visuomotor control is learning abstract representations that are effective for specifying goals, planning, and generalization. To this end, we introduce universal planning networks (UPN). UPNs embed differentiable planning within a goal-directed policy. This planning computation unrolls a forward model in a latent space and infers an optimal action plan through gradient… ▽ More

    Submitted 4 April, 2018; v1 submitted 2 April, 2018; originally announced April 2018.

    Comments: Videos available at https://sites.google.com/view/upn-public/home

  21. arXiv:1702.06054  [pdf, other

    cs.LG cs.AI cs.NE

    Learning to Repeat: Fine Grained Action Repetition for Deep Reinforcement Learning

    Authors: Sahil Sharma, Aravind Srinivas, Balaraman Ravindran

    Abstract: Reinforcement Learning algorithms can learn complex behavioral patterns for sequential decision making tasks wherein an agent interacts with an environment and acquires feedback in the form of rewards sampled from it. Traditionally, such algorithms make decisions, i.e., select actions to execute, at every single time step of the agent-environment interactions. In this paper, we propose a novel fra… ▽ More

    Submitted 21 September, 2020; v1 submitted 20 February, 2017; originally announced February 2017.

    Comments: 24 pages

  22. arXiv:1605.05365  [pdf, other

    cs.LG cs.AI cs.NE

    Dynamic Frame skip Deep Q Network

    Authors: Aravind Srinivas, Sahil Sharma, Balaraman Ravindran

    Abstract: Deep Reinforcement Learning methods have achieved state of the art performance in learning control policies for the games in the Atari 2600 domain. One of the important parameters in the Arcade Learning Environment (ALE) is the frame skip rate. It decides the granularity at which agents can control game play. A frame skip value of $k$ allows the agent to repeat a selected action $k$ number of time… ▽ More

    Submitted 21 September, 2020; v1 submitted 17 May, 2016; originally announced May 2016.

    Comments: IJCAI 2016 Workshop on Deep Reinforcement Learning: Frontiers and Challenges; 6 pages, 8 figures

  23. arXiv:1605.05359  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    Option Discovery in Hierarchical Reinforcement Learning using Spatio-Temporal Clustering

    Authors: Aravind Srinivas, Ramnandan Krishnamurthy, Peeyush Kumar, Balaraman Ravindran

    Abstract: This paper introduces an automated skill acquisition framework in reinforcement learning which involves identifying a hierarchical description of the given task in terms of abstract states and extended actions between abstract states. Identifying such structures present in the task provides ways to simplify and speed up reinforcement learning algorithms. These structures also help to generalize su… ▽ More

    Submitted 21 September, 2020; v1 submitted 17 May, 2016; originally announced May 2016.

    Comments: Revised version of ICML 16 Abstraction in Reinforcement Learning workshop paper

  24. arXiv:1510.02879  [pdf, other

    cs.AI cs.LG

    Attend, Adapt and Transfer: Attentive Deep Architecture for Adaptive Transfer from multiple sources in the same domain

    Authors: Janarthanan Rajendran, Aravind Srinivas, Mitesh M. Khapra, P Prasanna, Balaraman Ravindran

    Abstract: Transferring knowledge from prior source tasks in solving a new target task can be useful in several learning applications. The application of transfer poses two serious challenges which have not been adequately addressed. First, the agent should be able to avoid negative transfer, which happens when the transfer hampers or slows down the learning instead of helping it. Second, the agent should be… ▽ More

    Submitted 21 September, 2020; v1 submitted 10 October, 2015; originally announced October 2015.

    Comments: Published as a conference paper at ICLR 2017