Skip to main content

Showing 1–42 of 42 results for author: Leordeanu, M

  1. arXiv:2406.18266  [pdf, other

    cs.CL

    "Vorbeşti Româneşte?" A Recipe to Train Powerful Romanian LLMs with English Instructions

    Authors: Mihai Masala, Denis C. Ilie-Ablachim, Alexandru Dima, Dragos Corlatescu, Miruna Zavelca, Ovio Olaru, Simina Terian, Andrei Terian, Marius Leordeanu, Horia Velicu, Marius Popescu, Mihai Dascalu, Traian Rebedea

    Abstract: In recent years, Large Language Models (LLMs) have achieved almost human-like performance on various tasks. While some LLMs have been trained on multilingual data, most of the training data is in English; hence, their performance in English greatly exceeds other languages. To our knowledge, we are the first to collect and translate a large collection of texts, instructions, and benchmarks and trai… ▽ More

    Submitted 27 June, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2405.07703

  2. arXiv:2405.07703  [pdf, other

    cs.CL

    OpenLLM-Ro -- Technical Report on Open-source Romanian LLMs

    Authors: Mihai Masala, Denis C. Ilie-Ablachim, Dragos Corlatescu, Miruna Zavelca, Marius Leordeanu, Horia Velicu, Marius Popescu, Mihai Dascalu, Traian Rebedea

    Abstract: In recent years, Large Language Models (LLMs) have achieved almost human-like performance on various tasks. While some LLMs have been trained on multilingual data, most of the training data is in English. Hence, their performance in English greatly exceeds their performance in other languages. This document presents our approach to training and evaluating the first foundational and chat LLM specia… ▽ More

    Submitted 17 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  3. arXiv:2402.08035  [pdf, other

    cs.CV

    Multiple Random Masking Autoencoder Ensembles for Robust Multimodal Semi-supervised Learning

    Authors: Alexandru-Raul Todoran, Marius Leordeanu

    Abstract: There is an increasing number of real-world problems in computer vision and machine learning requiring to take into consideration multiple interpretation layers (modalities or views) of the world and learn how they relate to each other. For example, in the case of Earth Observations from satellite data, it is important to be able to predict one observation layer (e.g. vegetation index) from other… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 17 pages, 11 figures

  4. arXiv:2402.06385  [pdf, other

    cs.CV

    Maia: A Real-time Non-Verbal Chat for Human-AI Interaction

    Authors: Dragos Costea, Alina Marcu, Cristina Lazar, Marius Leordeanu

    Abstract: Face-to-face communication modeling in computer vision is an area of research focusing on developing algorithms that can recognize and analyze non-verbal cues and behaviors during face-to-face interactions. We propose an alternative to text chats for Human-AI interaction, based on non-verbal visual communication only, using facial expressions and head movements that mirror, but also improvise over… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 5 pages, 3 figures

  5. arXiv:2309.08612  [pdf, other

    cs.AI cs.CL cs.CV

    Explaining Vision and Language through Graphs of Events in Space and Time

    Authors: Mihai Masala, Nicolae Cudlenco, Traian Rebedea, Marius Leordeanu

    Abstract: Artificial Intelligence makes great advances today and starts to bridge the gap between vision and language. However, we are still far from understanding, explaining and controlling explicitly the visual content from a linguistic perspective, because we still lack a common explainable representation between the two domains. In this work we come to address this limitation and propose the Graph of E… ▽ More

    Submitted 29 August, 2023; originally announced September 2023.

    Comments: Accepted at IEEE International Conference on Computer Vision (ICCV) 2023 Workshops: 5th Workshop On Closing The Loop Between Vision And Language

  6. arXiv:2308.11021  [pdf, other

    cs.CV cs.LG

    Multi-Task Hypergraphs for Semi-supervised Learning using Earth Observations

    Authors: Mihai Pirvu, Alina Marcu, Alexandra Dobrescu, Nabil Belbachir, Marius Leordeanu

    Abstract: There are many ways of interpreting the world and they are highly interdependent. We exploit such complex dependencies and introduce a powerful multi-task hypergraph, in which every node is a task and different paths through the hypergraph reaching a given task become unsupervised teachers, by forming ensembles that learn to generate reliable pseudolabels for that task. Each hyperedge is part of a… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted in ICCV 2023 Workshops

  7. arXiv:2308.07615  [pdf, other

    cs.CV

    Self-supervised Hypergraphs for Learning Multiple World Interpretations

    Authors: Alina Marcu, Mihai Pirvu, Dragos Costea, Emanuela Haller, Emil Slusanschi, Ahmed Nabil Belbachir, Rahul Sukthankar, Marius Leordeanu

    Abstract: We present a method for learning multiple scene representations given a small labeled set, by exploiting the relationships between such representations in the form of a multi-task hypergraph. We also show how we can use the hypergraph to improve a powerful pretrained VisTransformer model without any additional labeled data. In our hypergraph, each node is an interpretation layer (e.g., depth or se… ▽ More

    Submitted 21 August, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: Accepted in ICCV 2023 Workshops

  8. arXiv:2308.04934  [pdf, other

    cs.CV cs.LG

    JEDI: Joint Expert Distillation in a Semi-Supervised Multi-Dataset Student-Teacher Scenario for Video Action Recognition

    Authors: Lucian Bicsi, Bogdan Alexe, Radu Tudor Ionescu, Marius Leordeanu

    Abstract: We propose JEDI, a multi-dataset semi-supervised learning method, which efficiently combines knowledge from multiple experts, learned on different datasets, to train and improve the performance of individual, per dataset, student models. Our approach achieves this by addressing two important problems in current machine learning research: generalization across datasets and limitations of supervised… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted in ICCV 2023 Workshops

  9. arXiv:2306.14709  [pdf, other

    cs.CV cs.LG cs.RO

    Self-supervised novel 2D view synthesis of large-scale scenes with efficient multi-scale voxel carving

    Authors: Alexandra Budisteanu, Dragos Costea, Alina Marcu, Marius Leordeanu

    Abstract: The task of generating novel views of real scenes is increasingly important nowadays when AI models become able to create realistic new worlds. In many practical applications, it is important for novel view synthesis methods to stay grounded in the physical world as much as possible, while also being able to imagine it from previously unseen views. While most current methods are developed and test… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: 11 pages, 3 figures

  10. arXiv:2305.12940  [pdf, other

    cs.CL

    GEST: the Graph of Events in Space and Time as a Common Representation between Vision and Language

    Authors: Mihai Masala, Nicolae Cudlenco, Traian Rebedea, Marius Leordeanu

    Abstract: One of the essential human skills is the ability to seamlessly build an inner representation of the world. By exploiting this representation, humans are capable of easily finding consensus between visual, auditory and linguistic perspectives. In this work, we set out to understand and emulate this ability through an explicit representation for both vision and language - Graphs of Events in Space a… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  11. Anatomically aware dual-hop learning for pulmonary embolism detection in CT pulmonary angiograms

    Authors: Florin Condrea, Saikiran Rapaka, Lucian Itu, Puneet Sharma, Jonathan Sperl, A Mohamed Ali, Marius Leordeanu

    Abstract: Pulmonary Embolisms (PE) represent a leading cause of cardiovascular death. While medical imaging, through computed tomographic pulmonary angiography (CTPA), represents the gold standard for PE diagnosis, it is still susceptible to misdiagnosis or significant diagnosis delays, which may be fatal for critical cases. Despite the recently demonstrated power of deep learning to bring a significant boo… ▽ More

    Submitted 17 May, 2024; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: Accepted to Computers in Biology and Medicine journal

  12. arXiv:2212.08058  [pdf, other

    cs.CV

    Learning a Fast 3D Spectral Approach to Object Segmentation and Tracking over Space and Time

    Authors: Elena Burceanu, Marius Leordeanu

    Abstract: We pose video object segmentation as spectral graph clustering in space and time, with one graph node for each pixel and edges forming local space-time neighborhoods. We claim that the strongest cluster in this video graph represents the salient object. We start by introducing a novel and efficient method based on 3D filtering for approximating the spectral solution, as the principal eigenvector o… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  13. arXiv:2104.08271  [pdf, other

    cs.CV

    TEACHTEXT: CrossModal Generalized Distillation for Text-Video Retrieval

    Authors: Ioana Croitoru, Simion-Vlad Bogolin, Marius Leordeanu, Hailin Jin, Andrew Zisserman, Samuel Albanie, Yang Liu

    Abstract: In recent years, considerable progress on the task of text-video retrieval has been achieved by leveraging large-scale pretraining on visual and audio datasets to construct powerful video encoders. By contrast, despite the natural symmetry, the design of effective algorithms for exploiting large-scale language pretraining remains under-explored. In this work, we are the first to investigate the de… ▽ More

    Submitted 26 September, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: ICCV 2021

  14. arXiv:2103.14417  [pdf, other

    cs.LG cs.CV

    Self-Supervised Learning in Multi-Task Graphs through Iterative Consensus Shift

    Authors: Emanuela Haller, Elena Burceanu, Marius Leordeanu

    Abstract: The human ability to synchronize the feedback from all their senses inspired recent works in multi-task and multi-modal learning. While these works rely on expensive supervision, our multi-task graph requires only pseudo-labels from expert models. Every graph node represents a task, and each edge learns between tasks transformations. Once initialized, the graph learns self-supervised, based on a n… ▽ More

    Submitted 4 November, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: Accepted at The British Machine Vision Conference (BMVC) 2021, 12 pages, 6 figures, 5 tables

  15. arXiv:2012.07123  [pdf, other

    cs.CV

    Iterative Knowledge Exchange Between Deep Learning and Space-Time Spectral Clustering for Unsupervised Segmentation in Videos

    Authors: Emanuela Haller, Adina Magda Florea, Marius Leordeanu

    Abstract: We propose a dual system for unsupervised object segmentation in video, which brings together two modules with complementary properties: a space-time graph that discovers objects in videos and a deep network that learns powerful object features. The system uses an iterative knowledge exchange policy. A novel spectral space-time clustering process on the graph produces unsupervised segmentation mas… ▽ More

    Submitted 13 December, 2020; originally announced December 2020.

  16. arXiv:2010.01910  [pdf, other

    cs.CV

    Semantics through Time: Semi-supervised Segmentation of Aerial Videos with Iterative Label Propagation

    Authors: Alina Marcu, Vlad Licaret, Dragos Costea, Marius Leordeanu

    Abstract: Semantic segmentation is a crucial task for robot navigation and safety. However, current supervised methods require a large amount of pixelwise annotations to yield accurate results. Labeling is a tedious and time consuming process that has hampered progress in low altitude UAV applications. This paper makes an important step towards automatic annotation by introducing SegProp, a novel iterative… ▽ More

    Submitted 2 October, 2020; originally announced October 2020.

    Comments: Accepted as oral presentation at Asian Conference on Computer Vision (ACCV), 2020. arXiv admin note: text overlap with arXiv:1910.10026

  17. arXiv:2010.01086  [pdf, other

    cs.CV

    Semi-Supervised Learning for Multi-Task Scene Understanding by Neural Graph Consensus

    Authors: Marius Leordeanu, Mihai Pirvu, Dragos Costea, Alina Marcu, Emil Slusanschi, Rahul Sukthankar

    Abstract: We address the challenging problem of semi-supervised learning in the context of multiple visual interpretations of the world by finding consensus in a graph of neural networks. Each graph node is a scene interpretation layer, while each edge is a deep net that transforms one layer at one node into another from a different node. During the supervised phase edge networks are trained independently.… ▽ More

    Submitted 3 December, 2020; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: Accepted at the 35th AAAI Conference on Artificial Intelligence (AAAI 2021)

  18. arXiv:2009.08427  [pdf, other

    cs.CV cs.LG

    Discovering Dynamic Salient Regions for Spatio-Temporal Graph Neural Networks

    Authors: Iulia Duta, Andrei Nicolicioiu, Marius Leordeanu

    Abstract: Graph Neural Networks are perfectly suited to capture latent interactions between various entities in the spatio-temporal domain (e.g. videos). However, when an explicit structure is not available, it is not obvious what atomic elements should be represented as nodes. Current works generally use pre-trained object detectors or fixed, predefined regions to extract graph nodes. Improving upon this,… ▽ More

    Submitted 7 December, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Accepted at Neural Information Processing Systems (NeurIPS 2021)

  19. arXiv:2006.12926  [pdf, other

    q-bio.PE cs.LG

    A self-supervised neural-analytic method to predict the evolution of COVID-19 in Romania

    Authors: Radu D. Stochiţoiu, Marian Petrica, Traian Rebedea, Ionel Popescu, Marius Leordeanu

    Abstract: Analysing and understanding the transmission and evolution of the COVID-19 pandemic is mandatory to be able to design the best social and medical policies, foresee their outcomes and deal with all the subsequent socio-economic effects. We address this important problem from a computational and machine learning perspective. More specifically, we want to statistically estimate all the relevant param… ▽ More

    Submitted 5 September, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: update author list

  20. arXiv:2004.07691  [pdf, other

    cs.CV cs.LG eess.IV

    In Search of Life: Learning from Synthetic Data to Detect Vital Signs in Videos

    Authors: Florin Condrea, Victor-Andrei Ivan, Marius Leordeanu

    Abstract: Automatically detecting vital signs in videos, such as the estimation of heart and respiration rates, is a challenging research problem in computer vision with important applications in the medical field. One of the key difficulties in tackling this task is the lack of sufficient supervised training data, which severely limits the use of powerful deep neural networks. In this paper we address this… ▽ More

    Submitted 23 April, 2020; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: Computer Vision and Pattern Recognition (CVPR) Workshop on Computer Vision for Physiological Measurement (CVPM) 2020

  21. arXiv:1910.10026  [pdf, other

    cs.CV

    Towards Automatic Annotation for Semantic Segmentation in Drone Videos

    Authors: Alina Marcu, Dragos Costea, Vlad Licaret, Marius Leordeanu

    Abstract: Semantic segmentation is a crucial task for robot navigation and safety. However, it requires huge amounts of pixelwise annotations to yield accurate results. While recent progress in computer vision algorithms has been heavily boosted by large ground-level datasets, the labeling time has hampered progress in low altitude UAV applications, mostly due to the difficulty imposed by large object scale… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

    Comments: 7 pages, 6 figures, submitted at the International Conference on Robotics and Automation (ICRA) 2020

  22. arXiv:1910.08967  [pdf, other

    cs.LG cs.CV stat.ML

    Image Difficulty Curriculum for Generative Adversarial Networks (CuGAN)

    Authors: Petru Soviany, Claudiu Ardei, Radu Tudor Ionescu, Marius Leordeanu

    Abstract: Despite the significant advances in recent years, Generative Adversarial Networks (GANs) are still notoriously hard to train. In this paper, we propose three novel curriculum learning strategies for training GANs. All strategies are first based on ranking the training images by their difficulty scores, which are estimated by a state-of-the-art image difficulty predictor. Our first strategy is to d… ▽ More

    Submitted 22 October, 2019; v1 submitted 20 October, 2019; originally announced October 2019.

    Comments: Accepted at WACV 2020

  23. arXiv:1910.02818  [pdf, other

    cs.CV cs.LG cs.RO

    Learning Navigation by Visual Localization and Trajectory Prediction

    Authors: Iulia Paraicu, Marius Leordeanu

    Abstract: When driving, people make decisions based on current traffic as well as their desired route. They have a mental map of known routes and are often able to navigate without needing directions. Current self-driving models improve their performances when using additional GPS information. Here we aim to push forward self-driving research and perform route planning even in the absence of GPS. Our system… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Comments: Submitted to ICRA 2020

  24. arXiv:1907.03326  [pdf, other

    cs.CV

    Spacetime Graph Optimization for Video Object Segmentation

    Authors: Emanuela Haller, Adina Magda Florea, Marius Leordeanu

    Abstract: We address the challenging task of foreground object discovery and segmentation in video. We introduce an efficient solution, suitable for both unsupervised and supervised scenarios, based on a spacetime graph representation of the video sequence. We ensure a fine grained representation with one-to-one correspondences between graph nodes and video pixels. We formulate the task as a spectral cluste… ▽ More

    Submitted 3 August, 2019; v1 submitted 7 July, 2019; originally announced July 2019.

  25. arXiv:1907.02731  [pdf, other

    cs.CV

    A 3D Convolutional Approach to Spectral Object Segmentation in Space and Time

    Authors: Elena Burceanu, Marius Leordeanu

    Abstract: We formulate object segmentation in video as a graph partitioning problem in space and time, in which nodes are pixels and their relations form local neighborhoods. We claim that the strongest cluster in this pixel-level graph represents the salient object segmentation. We compute the main cluster using a novel and fast 3D filtering technique that finds the spectral clustering solution, namely the… ▽ More

    Submitted 27 April, 2020; v1 submitted 5 July, 2019; originally announced July 2019.

    Comments: accepted at International Joint Conference on Artificial Intelligence 2020 (IJCAI-2020)

  26. arXiv:1905.09970  [pdf, other

    cs.CV cs.LG

    Shift R-CNN: Deep Monocular 3D Object Detection with Closed-Form Geometric Constraints

    Authors: Andretti Naiden, Vlad Paunescu, Gyeongmo Kim, ByeongMoon Jeon, Marius Leordeanu

    Abstract: We propose Shift R-CNN, a hybrid model for monocular 3D object detection, which combines deep learning with the power of geometry. We adapt a Faster R-CNN network for regressing initial 2D and 3D object properties and combine it with a least squares solution for the inverse 2D to 3D geometric mapping problem, using the camera projection matrix. The closed-form solution of the mathematical system,… ▽ More

    Submitted 23 May, 2019; originally announced May 2019.

    Comments: v1: Accepted to be published in 2019 IEEE International Conference on Image Processing, Sep 22-25, 2019, Taipei. IEEE Copyright notice added. Minor changes for camera-ready version. (updated May. 15, 2019)

  27. arXiv:1904.05582  [pdf, other

    cs.CV

    Recurrent Space-time Graph Neural Networks

    Authors: Andrei Nicolicioiu, Iulia Duta, Marius Leordeanu

    Abstract: Learning in the space-time domain remains a very challenging problem in machine learning and computer vision. Current computational models for understanding spatio-temporal visual data are heavily rooted in the classical single-image based paradigm. It is not yet well understood how to integrate information in space and time into a single, general model. We propose a neural graph model, recurrent… ▽ More

    Submitted 23 December, 2019; v1 submitted 11 April, 2019; originally announced April 2019.

    Journal ref: Advances in Neural Information Processing Systems 32 {NeurIPS 2019} pages 12838-1285

  28. Unsupervised learning of foreground object detection

    Authors: Ioana Croitoru, Simion-Vlad Bogolin, Marius Leordeanu

    Abstract: Unsupervised learning poses one of the most difficult challenges in computer vision today. The task has an immense practical value with many applications in artificial intelligence and emerging technologies, as large quantities of unlabeled videos can be collected at relatively low cost. In this paper, we address the unsupervised learning problem in the context of detecting the main foreground obj… ▽ More

    Submitted 14 August, 2018; originally announced August 2018.

    Comments: International Journal of Computer Vision (IJCV), 2019

  29. arXiv:1806.01954  [pdf, other

    cs.CV

    Mining for meaning: from vision to language through multiple networks consensus

    Authors: Iulia Duta, Andrei Liviu Nicolicioiu, Simion-Vlad Bogolin, Marius Leordeanu

    Abstract: Describing visual data into natural language is a very challenging task, at the intersection of computer vision, natural language processing and machine learning. Language goes well beyond the description of physical objects and their interactions and can convey the same abstract idea in many ways. It is both about content at the highest semantic level as well as about fluent form. Here we propose… ▽ More

    Submitted 18 September, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

    Comments: Accepted at BMVC 2018

    Journal ref: British Machine Vision Conference 2018, {BMVC} 2018

  30. arXiv:1804.01771  [pdf, other

    cs.CV

    Learning a Robust Society of Tracking Parts using Co-occurrence Constraints

    Authors: Elena Burceanu, Marius Leordeanu

    Abstract: Object tracking is an essential problem in computer vision that has been researched for several decades. One of the main challenges in tracking is to adapt to object appearance changes over time and avoiding drifting to background clutter. We address this challenge by proposing a deep neural network composed of different parts, which functions as a society of tracking parts. They work in conjuncti… ▽ More

    Submitted 8 November, 2018; v1 submitted 5 April, 2018; originally announced April 2018.

    Comments: 17+3 pages, 5 figures, European Conference on Computer Vision (ECCV), Visual Object Tracking workshop

  31. arXiv:1804.01322  [pdf, other

    cs.CV

    A Multi-Stage Multi-Task Neural Network for Aerial Scene Interpretation and Geolocalization

    Authors: Alina Marcu, Dragos Costea, Emil Slusanschi, Marius Leordeanu

    Abstract: Semantic segmentation and vision-based geolocalization in aerial images are challenging tasks in computer vision. Due to the advent of deep convolutional nets and the availability of relatively low cost UAVs, they are currently generating a growing attention in the field. We propose a novel multi-task multi-stage neural network that is able to handle the two problems at the same time, in a single… ▽ More

    Submitted 4 April, 2018; originally announced April 2018.

    Comments: 23 pages, 11 figures. Under review at the 15th European Conference on Computer Vision (ECCV 2018)

  32. arXiv:1705.09602  [pdf, other

    cs.CV

    Learning a Robust Society of Tracking Parts

    Authors: Elena Burceanu, Marius Leordeanu

    Abstract: Object tracking is an essential task in computer vision that has been studied since the early days of the field. Being able to follow objects that undergo different transformations in the video sequence, including changes in scale, illumination, shape and occlusions, makes the problem extremely difficult. One of the real challenges is to keep track of the changes in objects appearance and not drif… ▽ More

    Submitted 26 May, 2017; originally announced May 2017.

    Comments: 9.5 pages of main content, 2.5 of bibliography, 2 pages of appendix, 3 figures

  33. arXiv:1705.08280  [pdf, other

    cs.CV

    How hard can it be? Estimating the difficulty of visual search in an image

    Authors: Radu Tudor Ionescu, Bogdan Alexe, Marius Leordeanu, Marius Popescu, Dim P. Papadopoulos, Vittorio Ferrari

    Abstract: We address the problem of estimating image difficulty defined as the human response time for solving a visual search task. We collect human annotations of image difficulty for the PASCAL VOC 2012 data set through a crowd-sourcing platform. We then analyze what human interpretable image properties can have an impact on visual search difficulty, and how accurate are those properties for predicting d… ▽ More

    Submitted 23 May, 2017; originally announced May 2017.

    Comments: Published at CVPR 2016

    Journal ref: In Proceedings of CVPR, pp. 2157-2166, 2016

  34. arXiv:1704.05674  [pdf, other

    cs.CV

    Unsupervised object segmentation in video by efficient selection of highly probable positive features

    Authors: Emanuela Haller, Marius Leordeanu

    Abstract: We address an essential problem in computer vision, that of unsupervised object segmentation in video, where a main object of interest in a video sequence should be automatically separated from its background. An efficient solution to this task would enable large-scale video interpretation at a high semantic level in the absence of the costly manually labeled ground truth. We propose an efficient… ▽ More

    Submitted 19 April, 2017; originally announced April 2017.

  35. arXiv:1703.10901  [pdf, other

    cs.CV

    Unsupervised learning from video to detect foreground objects in single images

    Authors: Ioana Croitoru, Simion-Vlad Bogolin, Marius Leordeanu

    Abstract: Unsupervised learning from visual data is one of the most difficult challenges in computer vision, being a fundamental task for understanding how visual recognition works. From a practical point of view, learning from unsupervised visual input has an immense practical value, as very large quantities of unlabeled videos can be collected at low cost. In this paper, we address the task of unsupervise… ▽ More

    Submitted 31 March, 2017; originally announced March 2017.

  36. arXiv:1605.08323  [pdf, other

    cs.CV

    Aerial image geolocalization from recognition and matching of roads and intersections

    Authors: Dragos Costea, Marius Leordeanu

    Abstract: Aerial image analysis at a semantic level is important in many applications with strong potential impact in industry and consumer use, such as automated mapping, urban planning, real estate and environment monitoring, or disaster relief. The problem is enjoying a great interest in computer vision and remote sensing, due to increased computer power and improvement in automated image understanding a… ▽ More

    Submitted 26 May, 2016; originally announced May 2016.

  37. arXiv:1605.05462  [pdf, other

    cs.CV

    Dual Local-Global Contextual Pathways for Recognition in Aerial Imagery

    Authors: Alina Marcu, Marius Leordeanu

    Abstract: Visual context is important in object recognition and it is still an open problem in computer vision. Along with the advent of deep convolutional neural networks (CNN), using contextual information with such systems starts to receive attention in the literature. At the same time, aerial imagery is gaining momentum. While advances in deep learning make good progress in aerial image analysis, this p… ▽ More

    Submitted 18 May, 2016; originally announced May 2016.

  38. arXiv:1512.00517  [pdf, other

    cs.CV

    Labeling the Features Not the Samples: Efficient Video Classification with Minimal Supervision

    Authors: Marius Leordeanu, Alexandra Radu, Shumeet Baluja, Rahul Sukthankar

    Abstract: Feature selection is essential for effective visual recognition. We propose an efficient joint classifier learning and feature selection method that discovers sparse, compact representations of input features from a vast sea of candidates, with an almost unsupervised formulation. Our method requires only the following knowledge, which we call the \emph{feature sign}---whether or not a particular f… ▽ More

    Submitted 1 December, 2015; originally announced December 2015.

    Comments: arXiv admin note: text overlap with arXiv:1411.7714

  39. arXiv:1511.06674  [pdf, other

    cs.CV cs.CL

    Stories in the Eye: Contextual Visual Interactions for Efficient Video to Language Translation

    Authors: Anirudh Goyal, Marius Leordeanu

    Abstract: Integrating higher level visual and linguistic interpretations is at the heart of human intelligence. As automatic visual category recognition in images is approaching human performance, the high level understanding in the dynamic spatiotemporal domain of videos and its translation into natural language is still far from being solved. While most works on vision-to-text translations use pre-learned… ▽ More

    Submitted 20 November, 2015; originally announced November 2015.

  40. arXiv:1411.7714  [pdf, other

    cs.CV

    Features in Concert: Discriminative Feature Selection meets Unsupervised Clustering

    Authors: Marius Leordeanu, Alexandra Radu, Rahul Sukthankar

    Abstract: Feature selection is an essential problem in computer vision, important for category learning and recognition. Along with the rapid development of a wide variety of visual features and classifiers, there is a growing need for efficient feature selection and combination methods, to construct powerful classifiers for more complex and higher-level recognition tasks. We propose an algorithm that effic… ▽ More

    Submitted 27 November, 2014; originally announced November 2014.

  41. arXiv:1404.2903  [pdf, other

    cs.CV cs.LG cs.NE

    Thoughts on a Recursive Classifier Graph: a Multiclass Network for Deep Object Recognition

    Authors: Marius Leordeanu, Rahul Sukthankar

    Abstract: We propose a general multi-class visual recognition model, termed the Classifier Graph, which aims to generalize and integrate ideas from many of today's successful hierarchical recognition approaches. Our graph-based model has the advantage of enabling rich interactions between classes from different levels of interpretation and abstraction. The proposed multi-class system is efficiently learned… ▽ More

    Submitted 2 April, 2014; originally announced April 2014.

  42. arXiv:1202.3684  [pdf, other

    cs.CV

    Generalized Boundaries from Multiple Image Interpretations

    Authors: Marius Leordeanu, Rahul Sukthankar, Cristian Sminchisescu

    Abstract: Boundary detection is essential for a variety of computer vision tasks such as segmentation and recognition. In this paper we propose a unified formulation and a novel algorithm that are applicable to the detection of different types of boundaries, such as intensity edges, occlusion boundaries or object category specific boundaries. Our formulation leads to a simple method with state-of-the-art pe… ▽ More

    Submitted 16 February, 2012; originally announced February 2012.