Skip to main content

Showing 1–2 of 2 results for author: Belagali, V

  1. arXiv:2307.01849  [pdf, other

    cs.RO cs.CV cs.LG

    Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning

    Authors: Xiang Li, Varun Belagali, Jinghuan Shang, Michael S. Ryoo

    Abstract: Sequence modeling approaches have shown promising results in robot imitation learning. Recently, diffusion models have been adopted for behavioral cloning in a sequence modeling fashion, benefiting from their exceptional capabilities in modeling complex data distributions. The standard diffusion-based policy iteratively generates action sequences from random noise conditioned on the input states.… ▽ More

    Submitted 11 January, 2024; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 15 pages, 13 figures. Code, pretrained checkpoints, and datasets are available at https://github.com/LostXine/crossway_diffusion Video demo is at https://youtu.be/9deKHueZBuk

  2. arXiv:2203.06004  [pdf, other

    cs.CV eess.AS

    An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production

    Authors: Anwesha Roy, Varun Belagali, Prasanta Kumar Ghosh

    Abstract: The best performance in Air-tissue boundary (ATB) segmentation of real-time Magnetic Resonance Imaging (rtMRI) videos in speech production is known to be achieved by a 3-dimensional convolutional neural network (3D-CNN) model. However, the evaluation of this model, as well as other ATB segmentation techniques reported in the literature, is done using Dynamic Time Warping (DTW) distance between the… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Comments: accepted for ICASSP 2022