Skip to main content

Showing 1–17 of 17 results for author: Bober, M

  1. arXiv:2206.11352  [pdf, ps, other

    cs.CV

    Doubly Reparameterized Importance Weighted Structure Learning for Scene Graph Generation

    Authors: Daqi Liu, Miroslaw Bober, Josef Kittler

    Abstract: As a structured prediction task, scene graph generation, given an input image, aims to explicitly model objects and their relationships by constructing a visually-grounded scene graph. In the current literature, such task is universally solved via a message passing neural network based mean field variational Bayesian methodology. The classical loose evidence lower bound is generally chosen as the… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2205.07017

  2. arXiv:2205.09841  [pdf, other

    cs.CV

    Single-cell Subcellular Protein Localisation Using Novel Ensembles of Diverse Deep Architectures

    Authors: Syed Sameed Husain, Eng-Jon Ong, Dmitry Minskiy, Mikel Bober-Irizar, Amaia Irizar, Miroslaw Bober

    Abstract: Unravelling protein distributions within individual cells is key to understanding their function and state and indispensable to developing new treatments. Here we present the Hybrid subCellular Protein Localiser (HCPL), which learns from weakly labelled data to robustly localise single-cell subcellular protein patterns. It comprises innovative DNN architectures exploiting wavelet filters and learn… ▽ More

    Submitted 16 September, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

  3. arXiv:2205.07017  [pdf, other

    cs.CV

    Importance Weighted Structure Learning for Scene Graph Generation

    Authors: Daqi Liu, Miroslaw Bober, Josef Kittler

    Abstract: Scene graph generation is a structured prediction task aiming to explicitly model objects and their relationships via constructing a visually-grounded scene graph for an input image. Currently, the message passing neural network based mean field variational Bayesian methodology is the ubiquitous solution for such a task, in which the variational inference objective is often assumed to be the class… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

  4. arXiv:2203.15392  [pdf, other

    cs.CV

    Efficient Hybrid Network: Inducting Scattering Features

    Authors: Dmitry Minskiy, Miroslaw Bober

    Abstract: Recent work showed that hybrid networks, which combine predefined and learnt filters within a single architecture, are more amenable to theoretical analysis and less prone to overfitting in data-limited scenarios. However, their performance has yet to prove competitive against the conventional counterparts when sufficient amounts of training data are available. In an attempt to address this core l… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted to ICPR-2022

  5. arXiv:2201.11697  [pdf, other

    cs.CV cs.AI

    Constrained Structure Learning for Scene Graph Generation

    Authors: Daqi Liu, Miroslaw Bober, Josef Kittler

    Abstract: As a structured prediction task, scene graph generation aims to build a visually-grounded scene graph to explicitly model objects and their relationships in an input image. Currently, the mean field variational Bayesian framework is the de facto methodology used by the existing methods, in which the unconstrained inference step is often implemented by a message passing neural network. However, suc… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

  6. arXiv:2112.05727  [pdf, other

    cs.CV

    Neural Belief Propagation for Scene Graph Generation

    Authors: Daqi Liu, Miroslaw Bober, Josef Kittler

    Abstract: Scene graph generation aims to interpret an input image by explicitly modelling the potential objects and their relationships, which is predominantly solved by the message passing neural network models in previous methods. Currently, such approximation models generally assume the output variables are totally independent and thus ignore the informative structural higher-order interactions. This cou… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

  7. arXiv:2109.10304  [pdf, other

    cs.LG cs.CV

    Learning PAC-Bayes Priors for Probabilistic Neural Networks

    Authors: Maria Perez-Ortiz, Omar Rivasplata, Benjamin Guedj, Matthew Gleeson, Jingyu Zhang, John Shawe-Taylor, Miroslaw Bober, Josef Kittler

    Abstract: Recent works have investigated deep learning models trained by optimising PAC-Bayes bounds, with priors that are learnt on subsets of the data. This combination has been shown to lead not only to accurate classifiers, but also to remarkably tight risk certificates, bearing promise towards self-certified learning (i.e. use all the data to learn a predictor and certify its quality). In this work, we… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

  8. arXiv:2107.04458  [pdf, other

    cs.LG cs.CV

    Understanding the Distributions of Aggregation Layers in Deep Neural Networks

    Authors: Eng-Jon Ong, Sameed Husain, Miroslaw Bober

    Abstract: The process of aggregation is ubiquitous in almost all deep nets models. It functions as an important mechanism for consolidating deep features into a more compact representation, whilst increasing robustness to overfitting and providing spatial invariance in deep nets. In particular, the proximity of global aggregation layers to the output layers of DNNs mean that aggregated features have a direc… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

  9. arXiv:1907.05794  [pdf, other

    cs.CV

    ACTNET: end-to-end learning of feature activations and multi-stream aggregation for effective instance image retrieval

    Authors: Syed Sameed Husain, Eng-Jon Ong, Miroslaw Bober

    Abstract: We propose a novel CNN architecture called ACTNET for robust instance image retrieval from large-scale datasets. Our key innovation is a learnable activation layer designed to improve the signal-to-noise ratio (SNR) of deep convolutional feature maps. Further, we introduce a controlled multi-stream aggregation, where complementary deep features from different convolutional layers are optimally tra… ▽ More

    Submitted 23 October, 2020; v1 submitted 12 July, 2019; originally announced July 2019.

  10. REMAP: Multi-layer entropy-guided pooling of dense CNN features for image retrieval

    Authors: Syed Sameed Husain, Miroslaw Bober

    Abstract: This paper addresses the problem of very large-scale image retrieval, focusing on improving its accuracy and robustness. We target enhanced robustness of search to factors such as variations in illumination, object appearance and scale, partial occlusions, and cluttered backgrounds - particularly important when search is performed across very large datasets with significant variability. We propose… ▽ More

    Submitted 15 June, 2019; originally announced June 2019.

    Comments: Submitted to IEEE Trans. Image Processing on 24 May 2018, published 22 May 2019

    Journal ref: IEEE Transactions on Image Processing, Early Access 22 May 2019

  11. arXiv:1905.11387  [pdf, other

    eess.IV cs.CV

    Automatic Delineation of Kidney Region in DCE-MRI

    Authors: Santosh Tirunagari, Norman Poh, Kevin Wells, Miroslaw Bober, Isky Gorden, David Windridge

    Abstract: Delineation of the kidney region in dynamic contrast-enhanced magnetic resonance Imaging (DCE-MRI) is required during post-acquisition analysis in order to quantify various aspects of renal function, such as filtration and perfusion or blood flow. However, this can be obfuscated by the Partial Volume Effect (PVE), caused due to the mixing of any single voxel with two or more signal intensities fro… ▽ More

    Submitted 26 May, 2019; originally announced May 2019.

    Comments: arXiv admin note: text overlap with arXiv:1905.10218

  12. arXiv:1905.10218  [pdf, other

    eess.IV cs.CV

    Functional Segmentation through Dynamic Mode Decomposition: Automatic Quantification of Kidney Function in DCE-MRI Images

    Authors: Santosh Tirunagari, Norman Poh, Kevin Wells, Miroslaw Bober, Isky Gorden, David Windridge

    Abstract: Quantification of kidney function in Dynamic Contrast-Enhanced Magnetic Resonance Imaging (DCE-MRI) requires careful segmentation of the renal region of interest (ROI). Traditionally, human experts are required to manually delineate the kidney ROI across multiple images in the dynamic sequence. This approach is costly, time-consuming and labour intensive, and therefore acts to limit patient throug… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

  13. arXiv:1903.05434  [pdf, ps, other

    cs.CV

    Visual Semantic Information Pursuit: A Survey

    Authors: Daqi Liu, Miroslaw Bober, Josef Kittler

    Abstract: Visual semantic information comprises two important parts: the meaning of each visual semantic unit and the coherent visual semantic relation conveyed by these visual semantic units. Essentially, the former one is a visual perception task while the latter one corresponds to visual context reasoning. Remarkable advances in visual perception have been achieved due to the success of deep learning. In… ▽ More

    Submitted 13 March, 2019; originally announced March 2019.

    Comments: Preliminary work. Under review by IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI). Do not distribute

  14. arXiv:1807.01026  [pdf, other

    cs.CV

    Deep Architectures and Ensembles for Semantic Video Classification

    Authors: Eng-Jon Ong, Sameed Husain, Mikel Bober-Irizar, Miroslaw Bober

    Abstract: This work addresses the problem of accurate semantic labelling of short videos. To this end, a multitude of different deep nets, ranging from traditional recurrent neural networks (LSTM, GRU), temporal agnostic networks (FV,VLAD,BoW), fully connected neural networks mid-stage AV fusion and others. Additionally, we also propose a residual architecture-based DNN for video classification, with state-… ▽ More

    Submitted 7 October, 2018; v1 submitted 3 July, 2018; originally announced July 2018.

  15. arXiv:1707.04272  [pdf, other

    cs.CV

    Cultivating DNN Diversity for Large Scale Video Labelling

    Authors: Mikel Bober-Irizar, Sameed Husain, Eng-Jon Ong, Miroslaw Bober

    Abstract: We investigate factors controlling DNN diversity in the context of the Google Cloud and YouTube-8M Video Understanding Challenge. While it is well-known that ensemble methods improve prediction performance, and that combining accurate but diverse predictors helps, there is little knowledge on how to best promote & measure DNN diversity. We show that diversity can be cultivated by some unexpected m… ▽ More

    Submitted 13 July, 2017; originally announced July 2017.

    Comments: CVPR 2017 Youtube-8M Workshop

  16. arXiv:1702.00338  [pdf, other

    cs.CV

    Siamese Network of Deep Fisher-Vector Descriptors for Image Retrieval

    Authors: Eng-Jon Ong, Sameed Husain, Miroslaw Bober

    Abstract: This paper addresses the problem of large scale image retrieval, with the aim of accurately ranking the similarity of a large number of images to a given query image. To achieve this, we propose a novel Siamese network. This network consists of two computational strands, each comprising of a CNN component followed by a Fisher vector component. The CNN component produces dense, deep convolutional d… ▽ More

    Submitted 1 February, 2017; originally announced February 2017.

  17. arXiv:1607.06783  [pdf

    cs.CV

    Can DMD obtain a Scene Background in Color?

    Authors: Santosh Tirunagari, Norman Poh, Miroslaw Bober, David Windridge

    Abstract: A background model describes a scene without any foreground objects and has a number of applications, ranging from video surveillance to computational photography. Recent studies have introduced the method of Dynamic Mode Decomposition (DMD) for robustly separating video frames into a background model and foreground components. While the method introduced operates by converting color images to gra… ▽ More

    Submitted 22 July, 2016; originally announced July 2016.

    Comments: International Conference on Image, Vision and Computing (ICIVC 2016), August 3-5, 2016, Portsmouth, UK