Skip to main content

Showing 1–16 of 16 results for author: Gharaee, Z

  1. arXiv:2406.12723  [pdf, other

    cs.LG

    BIOSCAN-5M: A Multimodal Dataset for Insect Biodiversity

    Authors: Zahra Gharaee, Scott C. Lowe, ZeMing Gong, Pablo Millan Arias, Nicholas Pellegrino, Austin T. Wang, Joakim Bruslund Haurum, Iuliia Zarubiieva, Lila Kari, Dirk Steinke, Graham W. Taylor, Paul Fieguth, Angel X. Chang

    Abstract: As part of an ongoing worldwide effort to comprehend and monitor insect biodiversity, this paper presents the BIOSCAN-5M Insect dataset to the machine learning community and establish several benchmark tasks. BIOSCAN-5M is a comprehensive dataset containing multi-modal information for over 5 million insect specimens, and it significantly expands existing image-based biological datasets by includin… ▽ More

    Submitted 24 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  2. Video Relationship Detection Using Mixture of Experts

    Authors: Ala Shaabana, Zahra Gharaee, Paul Fieguth

    Abstract: Machine comprehension of visual information from images and videos by neural networks faces two primary challenges. Firstly, there exists a computational and inference gap in connecting vision and language, making it difficult to accurately determine which object a given agent acts on and represent it through language. Secondly, classifiers trained by a single, monolithic neural network often lack… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  3. arXiv:2402.11124  [pdf, other

    cs.LG

    Implicit Causal Representation Learning via Switchable Mechanisms

    Authors: Shayan Shirahmad Gale Bagi, Zahra Gharaee, Oliver Schulte, Mark Crowley

    Abstract: Learning causal representations from observational and interventional data in the absence of known ground-truth graph structures necessitates implicit latent causal representation learning. Implicit learning of causal mechanisms typically involves two categories of interventional data: hard and soft interventions. In real-world scenarios, soft interventions are often more realistic than hard inter… ▽ More

    Submitted 28 May, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  4. arXiv:2309.15274  [pdf, other

    cs.CV cs.AI

    Memory-Efficient Continual Learning Object Segmentation for Long Video

    Authors: Amir Nazemi, Mohammad Javad Shafiee, Zahra Gharaee, Paul Fieguth

    Abstract: Recent state-of-the-art semi-supervised Video Object Segmentation (VOS) methods have shown significant improvements in target object segmentation accuracy when information from preceding frames is used in segmenting the current frame. In particular, such memory-based approaches can help a model to more effectively handle appearance changes (representation drift) or occlusions. Ideally, for maximum… ▽ More

    Submitted 14 February, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

  5. arXiv:2307.10455  [pdf, other

    cs.CV cs.AI cs.LG

    A Step Towards Worldwide Biodiversity Assessment: The BIOSCAN-1M Insect Dataset

    Authors: Zahra Gharaee, ZeMing Gong, Nicholas Pellegrino, Iuliia Zarubiieva, Joakim Bruslund Haurum, Scott C. Lowe, Jaclyn T. A. McKeown, Chris C. Y. Ho, Joschka McLeod, Yi-Yun C Wei, Jireh Agda, Sujeevan Ratnasingham, Dirk Steinke, Angel X. Chang, Graham W. Taylor, Paul Fieguth

    Abstract: In an effort to catalog insect biodiversity, we propose a new large dataset of hand-labelled insect images, the BIOSCAN-Insect Dataset. Each record is taxonomically classified by an expert, and also has associated genetic information including raw nucleotide barcode sequences and assigned barcode index numbers, which are genetically-based proxies for species classification. This paper presents a c… ▽ More

    Submitted 13 November, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

  6. arXiv:2302.08635  [pdf, other

    cs.LG stat.ML

    Generative Causal Representation Learning for Out-of-Distribution Motion Forecasting

    Authors: Shayan Shirahmad Gale Bagi, Zahra Gharaee, Oliver Schulte, Mark Crowley

    Abstract: Conventional supervised learning methods typically assume i.i.d samples and are found to be sensitive to out-of-distribution (OOD) data. We propose Generative Causal Representation Learning (GCRL) which leverages causality to facilitate knowledge transfer under distribution shifts. While we evaluate the effectiveness of our proposed method in human trajectory prediction models, GCRL can be applied… ▽ More

    Submitted 25 April, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

  7. arXiv:2302.07360  [pdf, other

    cs.CV cs.AI cs.LG

    Self-supervised learning of object pose estimation using keypoint prediction

    Authors: Zahra Gharaee, Felix Järemo Lawin, Per-Erik Forssén

    Abstract: This paper describes recent developments in object specific pose and shape prediction from single images. The main contribution is a new approach to camera pose prediction by self-supervised learning of keypoints corresponding to locations on a category specific deformable shape. We designed a network to generate a proxy ground-truth heatmap from a set of keypoints distributed all over the categor… ▽ More

    Submitted 19 February, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 21 pages, 9 figures, 2 tables

  8. arXiv:2211.02537  [pdf, other

    cs.CV q-bio.PE

    Machine Learning Challenges of Biological Factors in Insect Image Data

    Authors: Nicholas Pellegrino, Zahra Gharaee, Paul Fieguth

    Abstract: The BIOSCAN project, led by the International Barcode of Life Consortium, seeks to study changes in biodiversity on a global scale. One component of the project is focused on studying the species interaction and dynamics of all insects. In addition to genetically barcoding insects, over 1.5 million images per year will be collected, each needing taxonomic classification. With the immense volume of… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: 4 pages, 3 figures. Submitted to the Journal of Computational Vision and Imaging Systems

    ACM Class: I.4.0; E.0; J.3

  9. arXiv:2202.04466  [pdf, other

    cs.AI cs.CV cs.HC cs.LG

    Predicting the intended action using internal simulation of perception

    Authors: Zahra Gharaee

    Abstract: This article proposes an architecture, which allows the prediction of intention by internally simulating perceptual states represented by action pattern vectors. To this end, associative self-organising neural networks (A-SOM) is utilised to build a hierarchical cognitive architecture for recognition and simulation of the skeleton based human actions. The abilities of the proposed architecture in… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

  10. Graph Representation Learning for Road Type Classification

    Authors: Zahra Gharaee, Shreyas Kowshik, Oliver Stromann, Michael Felsberg

    Abstract: We present a novel learning-based approach to graph representations of road networks employing state-of-the-art graph convolutional neural networks. Our approach is applied to realistic road networks of 17 cities from Open Street Map. While edge features are crucial to generate descriptive graph representations of road networks, graph convolutional networks usually rely on node features only. We s… ▽ More

    Submitted 3 June, 2022; v1 submitted 16 July, 2021; originally announced July 2021.

  11. arXiv:2104.14870  [pdf, other

    cs.CV cs.AI cs.HC cs.LG cs.RO

    Action in Mind: A Neural Network Approach to Action Recognition and Segmentation

    Authors: Zahra Gharaee

    Abstract: Recognizing and categorizing human actions is an important task with applications in various fields such as human-robot interaction, video analysis, surveillance, video retrieval, health care system and entertainment industry. This thesis presents a novel computational approach for human action recognition through different implementations of multi-layer architectures based on artificial neural ne… ▽ More

    Submitted 30 April, 2021; originally announced April 2021.

    Comments: Lund University Cognitive Science 2018

  12. arXiv:2104.11637  [pdf, other

    cs.CV cs.AI cs.HC cs.LG cs.RO

    Online recognition of unsegmented actions with hierarchical SOM architecture

    Authors: Zahra Gharaee

    Abstract: Automatic recognition of an online series of unsegmented actions requires a method for segmentation that determines when an action starts and when it ends. In this paper, a novel approach for recognizing unsegmented actions in online test experiments is proposed. The method uses self-organizing neural networks to build a three-layer cognitive architecture. The unique features of an action sequence… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

    Journal ref: Cogn Process 22, 77-91 (2021)

  13. arXiv:2104.11165  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Hierarchical growing grid networks for skeleton based action recognition

    Authors: Zahra Gharaee

    Abstract: In this paper, a novel cognitive architecture for action recognition is developed by applying layers of growing grid neural networks.Using these layers makes the system capable of automatically arranging its representational structure. In addition to the expansion of the neural map during the growth phase, the system is provided with a prior knowledge of the input space, which increases the proces… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

    Journal ref: Cognitive Systems Research, vol.63, pp.11-29 (2020)

  14. arXiv:2104.06070  [pdf, other

    cs.RO cs.CV cs.HC cs.LG

    Online Recognition of Actions Involving Objects

    Authors: Zahra Gharaee, Peter Gärdenfors, Magnus Johnsson

    Abstract: We present an online system for real time recognition of actions involving objects working in online mode. The system merges two streams of information processing running in parallel. One is carried out by a hierarchical self-organizing map (SOM) system that recognizes the performed actions by analysing the spatial trajectories of the agent's movements. It consists of two layers of SOMs and a cust… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

  15. First and Second Order Dynamics in a Hierarchical SOM system for Action Recognition

    Authors: Zahra Gharaee, Peter Gärdenfors, Magnus Johnsson

    Abstract: Human recognition of the actions of other humans is very efficient and is based on patterns of movements. Our theoretical starting point is that the dynamics of the joint movements is important to action categorization. On the basis of this theory, we present a novel action recognition system that employs a hierarchy of Self-Organizing Maps together with a custom supervised neural network that lea… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

  16. arXiv:2104.03807  [pdf, other

    cs.CV cs.AI

    A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular Control

    Authors: Zahra Gharaee, Karl Holmquist, Linbo He, Michael Felsberg

    Abstract: In this paper, we present a state-of-the-art reinforcement learning method for autonomous driving. Our approach employs temporal difference learning in a Bayesian framework to learn vehicle control signals from sensor data. The agent has access to images from a forward facing camera, which are preprocessed to generate semantic segmentation maps. We trained our system using both ground truth and es… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.