Skip to main content

Showing 1–10 of 10 results for author: Hampali, S

  1. arXiv:2406.09598  [pdf, other

    cs.CV

    Introducing HOT3D: An Egocentric Dataset for 3D Hand and Object Tracking

    Authors: Prithviraj Banerjee, Sindi Shkodrani, Pierre Moulon, Shreyas Hampali, Fan Zhang, Jade Fountain, Edward Miller, Selen Basol, Richard Newcombe, Robert Wang, Jakob Julian Engel, Tomas Hodan

    Abstract: We introduce HOT3D, a publicly available dataset for egocentric hand and object tracking in 3D. The dataset offers over 833 minutes (more than 3.7M images) of multi-view RGB/monochrome image streams showing 19 subjects interacting with 33 diverse rigid objects, multi-modal signals such as eye gaze or scene point clouds, as well as comprehensive ground truth annotations including 3D poses of object… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2403.18080  [pdf, other

    cs.CV

    EgoPoseFormer: A Simple Baseline for Egocentric 3D Human Pose Estimation

    Authors: Chenhongyi Yang, Anastasia Tkach, Shreyas Hampali, Linguang Zhang, Elliot J. Crowley, Cem Keskin

    Abstract: We present EgoPoseFormer, a simple yet effective transformer-based model for stereo egocentric human pose estimation. The main challenge in egocentric pose estimation is overcoming joint invisibility, which is caused by self-occlusion or a limited field of view (FOV) of head-mounted cameras. Our approach overcomes this challenge by incorporating a two-stage pose estimation paradigm: in the first s… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Tech Report

  3. arXiv:2403.17827  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions

    Authors: Sammy Christen, Shreyas Hampali, Fadime Sener, Edoardo Remelli, Tomas Hodan, Eric Sauser, Shugao Ma, Bugra Tekin

    Abstract: Generating natural hand-object interactions in 3D is challenging as the resulting hand and object motions are expected to be physically plausible and semantically meaningful. Furthermore, generalization to unseen objects is hindered by the limited scale of available hand-object interaction datasets. We propose DiffH2O, a novel method to synthesize realistic, one or two-handed object interactions f… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Project Page: https://diffh2o.github.io/

  4. arXiv:2211.16193  [pdf, other

    cs.CV

    In-Hand 3D Object Scanning from an RGB Sequence

    Authors: Shreyas Hampali, Tomas Hodan, Luan Tran, Lingni Ma, Cem Keskin, Vincent Lepetit

    Abstract: We propose a method for in-hand 3D scanning of an unknown object with a monocular camera. Our method relies on a neural implicit surface representation that captures both the geometry and the appearance of the object, however, by contrast with most NeRF-based methods, we do not assume that the camera-object relative poses are known. Instead, we simultaneously optimize both the object shape and the… ▽ More

    Submitted 22 June, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: CVPR 2023

  5. arXiv:2107.00887  [pdf, other

    cs.CV cs.HC

    HO-3D_v3: Improving the Accuracy of Hand-Object Annotations of the HO-3D Dataset

    Authors: Shreyas Hampali, Sayan Deb Sarkar, Vincent Lepetit

    Abstract: HO-3D is a dataset providing image sequences of various hand-object interaction scenarios annotated with the 3D pose of the hand and the object and was originally introduced as HO-3D_v2. The annotations were obtained automatically using an optimization method, 'HOnnotate', introduced in the original paper. HO-3D_v3 provides more accurate annotations for both the hand and object poses thus resultin… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

  6. arXiv:2104.14639  [pdf, other

    cs.CV

    Keypoint Transformer: Solving Joint Identification in Challenging Hands and Object Interactions for Accurate 3D Pose Estimation

    Authors: Shreyas Hampali, Sayan Deb Sarkar, Mahdi Rad, Vincent Lepetit

    Abstract: We propose a robust and accurate method for estimating the 3D poses of two hands in close interaction from a single color image. This is a very challenging problem, as large occlusions and many confusions between the joints may happen. State-of-the-art methods solve this problem by regressing a heatmap for each joint, which requires solving two problems simultaneously: localizing the joints and re… ▽ More

    Submitted 19 April, 2022; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: Accepted at CVPR2022

  7. arXiv:2103.07969  [pdf, other

    cs.CV cs.AI cs.LG

    Monte Carlo Scene Search for 3D Scene Understanding

    Authors: Shreyas Hampali, Sinisa Stekovic, Sayan Deb Sarkar, Chetan Srinivasa Kumar, Friedrich Fraundorfer, Vincent Lepetit

    Abstract: We explore how a general AI algorithm can be used for 3D scene understanding to reduce the need for training data. More exactly, we propose a modification of the Monte Carlo Tree Search (MCTS) algorithm to retrieve objects and room layouts from noisy RGB-D scans. While MCTS was developed as a game-playing algorithm, we show it can also be used for complex perception problems. Our adapted MCTS algo… ▽ More

    Submitted 5 May, 2021; v1 submitted 14 March, 2021; originally announced March 2021.

    Comments: To be presented at CVPR 2021

  8. arXiv:2003.13764  [pdf, other

    cs.CV

    Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction

    Authors: Anil Armagan, Guillermo Garcia-Hernando, Seungryul Baek, Shreyas Hampali, Mahdi Rad, Zhaohui Zhang, Shipeng Xie, MingXiu Chen, Boshen Zhang, Fu Xiong, Yang Xiao, Zhiguo Cao, Junsong Yuan, Pengfei Ren, Weiting Huang, Haifeng Sun, Marek Hrúz, Jakub Kanis, Zdeněk Krňoul, Qingfu Wan, Shile Li, Linlin Yang, Dongheui Lee, Angela Yao, Weiguo Zhou , et al. (10 additional authors not shown)

    Abstract: We study how well different types of approaches generalise in the task of 3D hand pose estimation under single hand scenarios and hand-object interaction. We show that the accuracy of state-of-the-art methods can drop, and that they fail mostly on poses absent from the training set. Unfortunately, since the space of hand poses is highly dimensional, it is inherently not feasible to cover the whole… ▽ More

    Submitted 10 September, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: European Conference on Computer Vision (ECCV), 2020

  9. arXiv:2001.02149  [pdf, other

    cs.CV

    General 3D Room Layout from a Single View by Render-and-Compare

    Authors: Sinisa Stekovic, Shreyas Hampali, Mahdi Rad, Sayan Deb Sarkar, Friedrich Fraundorfer, Vincent Lepetit

    Abstract: We present a novel method to reconstruct the 3D layout of a room (walls, floors, ceilings) from a single perspective view in challenging conditions, by contrast with previous single-view methods restricted to cuboid-shaped layouts. This input view can consist of a color image only, but considering a depth map results in a more accurate reconstruction. Our approach is formalized as solving a constr… ▽ More

    Submitted 21 July, 2020; v1 submitted 7 January, 2020; originally announced January 2020.

  10. arXiv:1907.01481  [pdf, other

    cs.CV

    HOnnotate: A method for 3D Annotation of Hand and Object Poses

    Authors: Shreyas Hampali, Mahdi Rad, Markus Oberweger, Vincent Lepetit

    Abstract: We propose a method for annotating images of a hand manipulating an object with the 3D poses of both the hand and the object, together with a dataset created using this method. Our motivation is the current lack of annotated real images for this problem, as estimating the 3D poses is challenging, mostly because of the mutual occlusions between the hand and the object. To tackle this challenge, we… ▽ More

    Submitted 30 May, 2020; v1 submitted 2 July, 2019; originally announced July 2019.

    Comments: Accepted to CVPR2020