Skip to main content

Showing 1–19 of 19 results for author: Rajasegaran, J

  1. arXiv:2404.09991  [pdf, other

    cs.RO cs.CV

    EgoPet: Egomotion and Interaction Data from an Animal's Perspective

    Authors: Amir Bar, Arya Bakhtiar, Danny Tran, Antonio Loquercio, Jathushan Rajasegaran, Yann LeCun, Amir Globerson, Trevor Darrell

    Abstract: Animals perceive the world to plan their actions and interact with other agents to accomplish complex tasks, demonstrating capabilities that are still unmatched by AI systems. To advance our understanding and reduce the gap between the capabilities of animals and AI systems, we introduce a dataset of pet egomotion imagery with diverse examples of simultaneous egomotion and multi-agent interaction.… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: https://www.amirbar.net/egopet

  2. arXiv:2402.19469  [pdf, other

    cs.RO cs.CV cs.LG

    Humanoid Locomotion as Next Token Prediction

    Authors: Ilija Radosavovic, Bike Zhang, Baifeng Shi, Jathushan Rajasegaran, Sarthak Kamat, Trevor Darrell, Koushil Sreenath, Jitendra Malik

    Abstract: We cast real-world humanoid control as a next token prediction problem, akin to predicting the next word in language. Our model is a causal transformer trained via autoregressive prediction of sensorimotor trajectories. To account for the multi-modal nature of the data, we perform prediction in a modality-aligned way, and for each input token predict the next token from the same modality. This gen… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  3. arXiv:2401.10889  [pdf, other

    cs.CV cs.AI

    Synthesizing Moving People with 3D Control

    Authors: Boyi Li, Jathushan Rajasegaran, Yossi Gandelsman, Alexei A. Efros, Jitendra Malik

    Abstract: In this paper, we present a diffusion model-based framework for animating people from a single image for a given target 3D motion sequence. Our approach has two core components: a) learning priors about invisible parts of the human body and clothing, and b) rendering novel body poses with proper clothing and texture. For the first part, we learn an in-filling diffusion model to hallucinate unseen… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  4. arXiv:2305.20091  [pdf, other

    cs.CV

    Humans in 4D: Reconstructing and Tracking Humans with Transformers

    Authors: Shubham Goel, Georgios Pavlakos, Jathushan Rajasegaran, Angjoo Kanazawa, Jitendra Malik

    Abstract: We present an approach to reconstruct humans and track them over time. At the core of our approach, we propose a fully "transformerized" version of a network for human mesh recovery. This network, HMR 2.0, advances the state of the art and shows the capability to analyze unusual poses that have in the past been difficult to reconstruct from single images. To analyze video, we use 3D reconstruction… ▽ More

    Submitted 31 August, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: In ICCV 2023. Project Webpage: https://shubham-goel.github.io/4dhumans/

  5. arXiv:2304.01199  [pdf, other

    cs.CV

    On the Benefits of 3D Pose and Tracking for Human Action Recognition

    Authors: Jathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, Christoph Feichtenhofer, Jitendra Malik

    Abstract: In this work we study the benefits of using tracking and 3D poses for action recognition. To achieve this, we take the Lagrangian view on analysing actions over a trajectory of human motion rather than at a fixed point in space. Taking this stand allows us to use the tracklets of people to predict their actions. In this spirit, first we show the benefits of using 3D pose to infer actions, and stud… ▽ More

    Submitted 7 August, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: CVPR2023 (project page: https://brjathu.github.io/LART)

  6. arXiv:2202.00263  [pdf, other

    cs.LG cs.CV

    Fully Online Meta-Learning Without Task Boundaries

    Authors: Jathushan Rajasegaran, Chelsea Finn, Sergey Levine

    Abstract: While deep networks can learn complex functions such as classifiers, detectors, and trackers, many applications require models that continually adapt to changing input distributions, changing tasks, and changing environmental conditions. Indeed, this ability to continuously accrue knowledge and use past experience to learn new tasks quickly in continual settings is one of the key properties of an… ▽ More

    Submitted 17 February, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

  7. arXiv:2112.04477  [pdf, other

    cs.CV

    Tracking People by Predicting 3D Appearance, Location & Pose

    Authors: Jathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, Jitendra Malik

    Abstract: In this paper, we present an approach for tracking people in monocular videos, by predicting their future 3D representations. To achieve this, we first lift people to 3D from a single frame in a robust way. This lifting includes information about the 3D pose of the person, his or her location in the 3D space, and the 3D appearance. As we track a person, we collect 3D observations over time in a tr… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

    Comments: Project Page : https://brjathu.github.io/PHALP/

  8. arXiv:2111.07868  [pdf, other

    cs.CV

    Tracking People with 3D Representations

    Authors: Jathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, Jitendra Malik

    Abstract: We present a novel approach for tracking multiple people in video. Unlike past approaches which employ 2D representations, we focus on using 3D representations of people, located in three-dimensional space. To this end, we develop a method, Human Mesh and Appearance Recovery (HMAR) which in addition to extracting the 3D geometry of the person as a SMPL mesh, also extracts appearance as a texture m… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

  9. arXiv:2010.09291  [pdf, other

    cs.LG cs.AI cs.CV

    Meta-learning the Learning Trends Shared Across Tasks

    Authors: Jathushan Rajasegaran, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Mubarak Shah

    Abstract: Meta-learning stands for 'learning to learn' such that generalization to new tasks is achieved. Among these methods, Gradient-based meta-learning algorithms are a specific sub-class that excel at quick adaptation to new tasks with limited data. This demonstrates their ability to acquire transferable knowledge, a capability that is central to human learning. However, the existing meta-learning appr… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: Code will be released at https://github.com/brjathu/PAMELA

  10. arXiv:2006.09785  [pdf, other

    cs.CV

    Self-supervised Knowledge Distillation for Few-shot Learning

    Authors: Jathushan Rajasegaran, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Mubarak Shah

    Abstract: Real-world contains an overwhelmingly large number of object classes, learning all of which at once is infeasible. Few shot learning is a promising learning paradigm due to its ability to learn out of order distributions quickly with only a few samples. Recent works [7, 41] show that simply learning a good feature embedding can outperform more sophisticated meta-learning and metric learning algori… ▽ More

    Submitted 4 August, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

  11. arXiv:2006.02231  [pdf, other

    cs.CR cs.CV cs.NE

    A Multi-modal Neural Embeddings Approach for Detecting Mobile Counterfeit Apps: A Case Study on Google Play Store

    Authors: Naveen Karunanayake, Jathushan Rajasegaran, Ashanie Gunathillake, Suranga Seneviratne, Guillaume Jourjon

    Abstract: Counterfeit apps impersonate existing popular apps in attempts to misguide users to install them for various reasons such as collecting personal information or spreading malware. Many counterfeits can be identified once installed, however even a tech-savvy user may struggle to detect them before installation. To this end, this paper proposes to leverage the recent advances in deep learning methods… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1804.09882

  12. arXiv:2003.11652  [pdf, other

    cs.LG cs.CV stat.ML

    iTAML: An Incremental Task-Agnostic Meta-learning Approach

    Authors: Jathushan Rajasegaran, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Mubarak Shah

    Abstract: Humans can continuously learn new knowledge as their experience grows. In contrast, previous learning in deep neural networks can quickly fade out when they are trained on a new task. In this paper, we hypothesize this problem can be avoided by learning a set of generalized parameters, that are neither specific to old nor new tasks. In this pursuit, we introduce a novel meta-learning approach that… ▽ More

    Submitted 25 March, 2020; originally announced March 2020.

    Comments: Accepted to CVPR 2020

  13. arXiv:2003.08798  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Incremental Object Detection via Meta-Learning

    Authors: K J Joseph, Jathushan Rajasegaran, Salman Khan, Fahad Shahbaz Khan, Vineeth N Balasubramanian

    Abstract: In a real-world setting, object instances from new classes can be continuously encountered by object detectors. When existing object detectors are applied to such scenarios, their performance on old classes deteriorates significantly. A few efforts have been reported to address this limitation, all of which apply variants of knowledge distillation to avoid catastrophic forgetting. We note that alt… ▽ More

    Submitted 15 December, 2021; v1 submitted 17 March, 2020; originally announced March 2020.

    Comments: Published in IEEE Transactions on Pattern Analysis & Machine Intelligence, Nov 2021. Code is available in https://github.com/JosephKJ/iOD

    Journal ref: TPAMI, Nov 2021

  14. arXiv:1911.11800  [pdf, other

    cs.LG cs.AI stat.ML

    TimeCaps: Capturing Time Series Data With Capsule Networks

    Authors: Hirunima Jayasekara, Vinoj Jayasundara, Mohamed Athif, Jathushan Rajasegaran, Sandaru Jayasekara, Suranga Seneviratne, Ranga Rodrigo

    Abstract: Capsule networks excel in understanding spatial relationships in 2D data for vision related tasks. Even though they are not designed to capture 1D temporal relationships, with TimeCaps we demonstrate that given the ability, capsule networks excel in understanding temporal relationships. To this end, we generate capsules along the temporal and channel dimensions creating two temporal feature detect… ▽ More

    Submitted 18 June, 2022; v1 submitted 26 November, 2019; originally announced November 2019.

  15. arXiv:1906.01120  [pdf, other

    cs.CV

    An Adaptive Random Path Selection Approach for Incremental Learning

    Authors: Jathushan Rajasegaran, Munawar Hayat, Salman Khan, Fahad Shahbaz Khan, Ling Shao, Ming-Hsuan Yang

    Abstract: In a conventional supervised learning setting, a machine learning model has access to examples of all object classes that are desired to be recognized during the inference stage. This results in a fixed model that lacks the flexibility to adapt to new learning tasks. In practical settings, learning tasks often arrive in a sequence and the models must continually learn to increment their previously… ▽ More

    Submitted 24 January, 2020; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: Extended version of Random Path Selection for Incremental Learning, published at NeurIPS 2019

  16. arXiv:1904.09546  [pdf, other

    cs.CV

    DeepCaps: Going Deeper with Capsule Networks

    Authors: Jathushan Rajasegaran, Vinoj Jayasundara, Sandaru Jayasekara, Hirunima Jayasekara, Suranga Seneviratne, Ranga Rodrigo

    Abstract: Capsule Network is a promising concept in deep learning, yet its true potential is not fully realized thus far, providing sub-par performance on several key benchmark datasets with complex data. Drawing intuition from the success achieved by Convolutional Neural Networks (CNNs) by going deeper, we introduce DeepCaps1, a deep capsule network architecture which uses a novel 3D convolution based dyna… ▽ More

    Submitted 21 April, 2019; originally announced April 2019.

  17. TextCaps : Handwritten Character Recognition with Very Small Datasets

    Authors: Vinoj Jayasundara, Sandaru Jayasekara, Hirunima Jayasekara, Jathushan Rajasegaran, Suranga Seneviratne, Ranga Rodrigo

    Abstract: Many localized languages struggle to reap the benefits of recent advancements in character recognition systems due to the lack of substantial amount of labeled training data. This is due to the difficulty in generating large amounts of labeled data for such languages and inability of deep learning techniques to properly learn from small number of training samples. We solve this problem by introduc… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

    Journal ref: In 2019 IEEE Winter Conference on Applications of Computer Vision (WACV) (pp. 254-262). IEEE 2019

  18. Combined Static and Motion Features for Deep-Networks Based Activity Recognition in Videos

    Authors: Sameera Ramasinghe, Jathushan Rajasegaran, Vinoj Jayasundara, Kanchana Ranasinghe, Ranga Rodrigo, Ajith A. Pasqual

    Abstract: Activity recognition in videos in a deep-learning setting---or otherwise---uses both static and pre-computed motion components. The method of combining the two components, whilst keeping the burden on the deep network less, still remains uninvestigated. Moreover, it is not clear what the level of contribution of individual components is, and how to control the contribution. In this work, we use a… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.

    Journal ref: IEEE Transactions on Circuits and Systems for Video Technology (2017)

  19. arXiv:1804.09882  [pdf, other

    cs.CR cs.LG cs.NE

    A Neural Embeddings Approach for Detecting Mobile Counterfeit Apps

    Authors: Jathushan Rajasegaran, Suranga Seneviratne, Guillaume Jourjon

    Abstract: Counterfeit apps impersonate existing popular apps in attempts to misguide users to install them for various reasons such as collecting personal information, spreading malware, or simply to increase their advertisement revenue. Many counterfeits can be identified once installed, however even a tech-savvy user may struggle to detect them before installation as app icons and descriptions can be quit… ▽ More

    Submitted 25 April, 2018; originally announced April 2018.