Skip to main content

Showing 1–50 of 62 results for author: Junior, A

  1. arXiv:2406.06512  [pdf, other

    cs.CV cs.AI

    Merlin: A Vision Language Foundation Model for 3D Computed Tomography

    Authors: Louis Blankemeier, Joseph Paul Cohen, Ashwin Kumar, Dave Van Veen, Syed Jamal Safdar Gardezi, Magdalini Paschali, Zhihong Chen, Jean-Benoit Delbrouck, Eduardo Reis, Cesar Truyts, Christian Bluethgen, Malte Engmann Kjeldskov Jensen, Sophie Ostmeier, Maya Varma, Jeya Maria Jose Valanarasu, Zhongnan Fang, Zepeng Huo, Zaid Nabulsi, Diego Ardila, Wei-Hung Weng, Edson Amaro Junior, Neera Ahuja, Jason Fries, Nigam H. Shah, Andrew Johnston , et al. (6 additional authors not shown)

    Abstract: Over 85 million computed tomography (CT) scans are performed annually in the US, of which approximately one quarter focus on the abdomen. Given the current radiologist shortage, there is a large impetus to use artificial intelligence to alleviate the burden of interpreting these complex imaging studies. Prior state-of-the-art approaches for automated medical image interpretation leverage vision la… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 18 pages, 7 figures

  2. arXiv:2405.10678  [pdf

    cs.CY cs.CR

    IT Strategic alignment in the decentralized finance (DeFi): CBDC and digital currencies

    Authors: Carlos Alberto Durigan Junior, Fernando Jose Barbin Laurindo

    Abstract: Cryptocurrency can be understood as a digital asset transacted among participants in the crypto economy. Every cryptocurrency must have an associated Blockchain. Blockchain is a Distributed Ledger Technology (DLT) which supports cryptocurrencies, this may be considered as the most promising disruptive technology in the industry 4.0 context. Decentralized finance (DeFi) is a Blockchain-based financ… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Keywords: IT Strategic alignment, Decentralized Finance (DeFi), Cryptocurrency, Digital Economy

  3. arXiv:2405.05129  [pdf, other

    cs.SI

    Web Intelligence Journal in perspective: an analysis of its two decades trajectory

    Authors: Diogenes Ademir Domingos, Victor Emanuel Santos Moura, Antonio Fernando Lavareda Jacob Junior, Fabio Manoel Franca Lobato

    Abstract: The evolution of a thematic area undergoes various changes of perspective and adopts new theoretical approaches that arise from the interactions of the community and a wide range of social needs. The advent of digital technologies, such as social networks, underlines this factor by spreading knowledge and forging links between different communities. Web intelligence is now on the verge of raising… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  4. arXiv:2404.01552  [pdf

    cs.CY

    The use of the open innovation paradigm in the public sector: a systematic review of published studies

    Authors: Joel Alves de Lima Júnior, Kiev Gama, Jorge da Silva Correia Neto

    Abstract: The use of the open innovation paradigm has been, over the past years, getting special attention in the public sector. Motivated by an urban environment that is increasingly more complex and challenging, several government agencies have been allocating financial resources and efforts to promote open and participative government initiatives. As a way to try and understand this scenario, a systemati… ▽ More

    Submitted 8 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: 33 pages, 6 figures and 18 tables

  5. arXiv:2404.00399  [pdf, other

    cs.CL cs.AI cs.LG

    Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

    Authors: Taishi Nakamura, Mayank Mishra, Simone Tedeschi, Yekun Chai, Jason T Stillerman, Felix Friedrich, Prateek Yadav, Tanmay Laud, Vu Minh Chien, Terry Yue Zhuo, Diganta Misra, Ben Bogin, Xuan-Son Vu, Marzena Karpinska, Arnav Varma Dantuluri, Wojciech Kusa, Tommaso Furlanello, Rio Yokota, Niklas Muennighoff, Suhas Pai, Tosin Adewumi, Veronika Laippala, Xiaozhe Yao, Adalberto Junior, Alpay Ariyak , et al. (20 additional authors not shown)

    Abstract: Pretrained language models underpin several AI applications, but their high computational cost for training limits accessibility. Initiatives such as BLOOM and StarCoder aim to democratize access to pretrained models for collaborative community development. However, such existing models face challenges: limited multilingual capabilities, continual pretraining causing catastrophic forgetting, where… ▽ More

    Submitted 23 April, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: Preprint

  6. An Incremental MaxSAT-based Model to Learn Interpretable and Balanced Classification Rules

    Authors: Antônio Carlos Souza Ferreira Júnior, Thiago Alves Rocha

    Abstract: The increasing advancements in the field of machine learning have led to the development of numerous applications that effectively address a wide range of problems with accurate predictions. However, in certain cases, accuracy alone may not be sufficient. Many real-world problems also demand explanations and interpretability behind the predictions. One of the most popular interpretable models that… ▽ More

    Submitted 29 April, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: 16 pages, 5 tables, submitted to BRACIS 2023 (Brazilian Conference on Intelligent Systems), accepted version published in Intelligent Systems, LNCS, vol 14195

    ACM Class: I.2.4; I.2.6

    Journal ref: Intelligent Systems (2023), LNCS, vol 14195 (pp. 227-242), Springer Nature

  7. arXiv:2401.02909  [pdf, other

    cs.CL

    Introducing Bode: A Fine-Tuned Large Language Model for Portuguese Prompt-Based Task

    Authors: Gabriel Lino Garcia, Pedro Henrique Paiola, Luis Henrique Morelli, Giovani Candido, Arnaldo Cândido Júnior, Danilo Samuel Jodas, Luis C. S. Afonso, Ivan Rizzo Guilherme, Bruno Elias Penteado, João Paulo Papa

    Abstract: Large Language Models (LLMs) are increasingly bringing advances to Natural Language Processing. However, low-resource languages, those lacking extensive prominence in datasets for various NLP tasks, or where existing datasets are not as substantial, such as Portuguese, already obtain several benefits from LLMs, but not to the same extent. LLMs trained on multilingual datasets normally struggle to… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 10 pages, 3 figures

  8. arXiv:2312.08773  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Offshore Wind Plant Instance Segmentation Using Sentinel-1 Time Series, GIS, and Semantic Segmentation Models

    Authors: Osmar Luiz Ferreira de Carvalho, Osmar Abilio de Carvalho Junior, Anesmar Olino de Albuquerque, Daniel Guerreiro e Silva

    Abstract: Offshore wind farms represent a renewable energy source with a significant global growth trend, and their monitoring is strategic for territorial and environmental planning. This study's primary objective is to detect offshore wind plants at an instance level using semantic segmentation models and Sentinel-1 time series. The secondary objectives are: (a) to develop a database consisting of labeled… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 21 pages, 5 figures

    MSC Class: 68T45 ACM Class: I.4.6

  9. arXiv:2311.05051  [pdf, other

    cs.CL

    Deep Learning Brasil at ABSAPT 2022: Portuguese Transformer Ensemble Approaches

    Authors: Juliana Resplande Santanna Gomes, Eduardo Augusto Santos Garcia, Adalberto Ferreira Barbosa Junior, Ruan Chaves Rodrigues, Diogo Fernandes Costa Silva, Dyonnatan Ferreira Maia, Nádia Félix Felipe da Silva, Arlindo Rodrigues Galvão Filho, Anderson da Silva Soares

    Abstract: Aspect-based Sentiment Analysis (ABSA) is a task whose objective is to classify the individual sentiment polarity of all entities, called aspects, in a sentence. The task is composed of two subtasks: Aspect Term Extraction (ATE), identify all aspect terms in a sentence; and Sentiment Orientation Extraction (SOE), given a sentence and its aspect terms, the task is to determine the sentiment polarit… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 11 pages, 3 figures, In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022), Online. CEUR. org

    Report number: urn:nbn:de:0074-3202-9

  10. DeepLearningBrasil@LT-EDI-2023: Exploring Deep Learning Techniques for Detecting Depression in Social Media Text

    Authors: Eduardo Garcia, Juliana Gomes, Adalberto Barbosa Júnior, Cardeque Borges, Nádia da Silva

    Abstract: In this paper, we delineate the strategy employed by our team, DeepLearningBrasil, which secured us the first place in the shared task DepSign-LT-EDI@RANLP-2023, achieving a 47.0% Macro F1-Score and a notable 2.4% advantage. The task was to classify social media texts into three distinct levels of depression - "not depressed," "moderately depressed," and "severely depressed." Leveraging the power… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Report number: 2023.ltedi-1.42

  11. arXiv:2310.16148  [pdf, other

    cs.CV cs.AI

    Yin Yang Convolutional Nets: Image Manifold Extraction by the Analysis of Opposites

    Authors: Augusto Seben da Rosa, Frederico Santos de Oliveira, Anderson da Silva Soares, Arnaldo Candido Junior

    Abstract: Computer vision in general presented several advances such as training optimizations, new architectures (pure attention, efficient block, vision language models, generative models, among others). This have improved performance in several tasks such as classification, and others. However, the majority of these models focus on modifications that are taking distance from realistic neuroscientific app… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 12 pages, 5 tables and 6 figures

    ACM Class: I.2.10

  12. arXiv:2309.11052  [pdf, other

    cs.CL cs.LG stat.ML

    fakenewsbr: A Fake News Detection Platform for Brazilian Portuguese

    Authors: Luiz Giordani, Gilsiley Darú, Rhenan Queiroz, Vitor Buzinaro, Davi Keglevich Neiva, Daniel Camilo Fuentes Guzmán, Marcos Jardel Henriques, Oilson Alberto Gonzatto Junior, Francisco Louzada

    Abstract: The proliferation of fake news has become a significant concern in recent times due to its potential to spread misinformation and manipulate public opinion. This paper presents a comprehensive study on detecting fake news in Brazilian Portuguese, focusing on journalistic-type news. We propose a machine learning-based approach that leverages natural language processing techniques, including TF-IDF… ▽ More

    Submitted 20 September, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

  13. arXiv:2306.13116  [pdf, other

    cs.LG cs.AI

    A Machine Learning Pressure Emulator for Hydrogen Embrittlement

    Authors: Minh Triet Chau, João Lucas de Sousa Almeida, Elie Alhajjar, Alberto Costa Nogueira Junior

    Abstract: A recent alternative for hydrogen transportation as a mixture with natural gas is blending it into natural gas pipelines. However, hydrogen embrittlement of material is a major concern for scientists and gas installation designers to avoid process failures. In this paper, we propose a physics-informed machine learning model to predict the gas pressure on the pipes' inner wall. Despite its high-fid… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

  14. arXiv:2306.10097  [pdf, other

    eess.AS cs.AI cs.CL

    CML-TTS A Multilingual Dataset for Speech Synthesis in Low-Resource Languages

    Authors: Frederico S. Oliveira, Edresson Casanova, Arnaldo Cândido Júnior, Anderson S. Soares, Arlindo R. Galvão Filho

    Abstract: In this paper, we present CML-TTS, a recursive acronym for CML-Multi-Lingual-TTS, a new Text-to-Speech (TTS) dataset developed at the Center of Excellence in Artificial Intelligence (CEIA) of the Federal University of Goias (UFG). CML-TTS is based on Multilingual LibriSpeech (MLS) and adapted for training TTS models, consisting of audiobooks in seven languages: Dutch, French, German, Italian, Port… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 12 pages, 5 figures, Accepted at the 25th International Conference on Text, Speech and Dialogue (TSD 2022)

  15. arXiv:2306.09979  [pdf, other

    cs.SD cs.AI eess.AS

    Evaluation of Speech Representations for MOS prediction

    Authors: Frederico S. Oliveira, Edresson Casanova, Arnaldo Cândido Júnior, Lucas R. S. Gris, Anderson S. Soares, Arlindo R. Galvão Filho

    Abstract: In this paper, we evaluate feature extraction models for predicting speech quality. We also propose a model architecture to compare embeddings of supervised learning and self-supervised learning models with embeddings of speaker verification models to predict the metric MOS. Our experiments were performed on the VCC2018 dataset and a Brazilian-Portuguese dataset called BRSpeechMOS, which was creat… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 12 pages, 4 figures, Accepted to the 26th International Conference of Text, Speech and Dialogue (TSD2023)

  16. arXiv:2306.09426  [pdf, other

    eess.IV cs.CV cs.LG

    Deep learning techniques for blind image super-resolution: A high-scale multi-domain perspective evaluation

    Authors: Valdivino Alexandre de Santiago Júnior

    Abstract: Despite several solutions and experiments have been conducted recently addressing image super-resolution (SR), boosted by deep learning (DL) techniques, they do not usually design evaluations with high scaling factors, capping it at 2x or 4x. Moreover, the datasets are generally benchmarks which do not truly encompass significant diversity of domains to proper evaluate the techniques. It is also i… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 21 pages

  17. arXiv:2305.14580  [pdf, other

    cs.CL cs.AI

    Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person

    Authors: Lucas Rafael Stefanel Gris, Ricardo Marcacini, Arnaldo Candido Junior, Edresson Casanova, Anderson Soares, Sandra Maria Aluísio

    Abstract: Automatic speech recognition (ASR) systems play a key role in applications involving human-machine interactions. Despite their importance, ASR models for the Portuguese language proposed in the last decade have limitations in relation to the correct identification of punctuation marks in automatic transcriptions, which hinder the use of transcriptions by other systems, models, and even by humans.… ▽ More

    Submitted 26 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  18. arXiv:2304.04966  [pdf

    cs.CV

    Computer Vision-Aided Intelligent Monitoring of Coffee: Towards Sustainable Coffee Production

    Authors: Francisco Eron, Muhammad Noman, Raphael Ricon de Oliveira, Deigo de Souza Marques, Rafael Serapilha Durelli, Andre Pimenta Freire, Antonio Chalfun Junior

    Abstract: Coffee which is prepared from the grinded roasted seeds of harvested coffee cherries, is one of the most consumed beverage and traded commodity, globally. To manually monitor the coffee field regularly, and inform about plant and soil health, as well as estimate yield and harvesting time, is labor-intensive, time-consuming and error-prone. Some recent studies have developed sensors for estimating… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  19. arXiv:2304.04833  [pdf, other

    cs.DC

    A visão da BBChain sobre o contexto tecnológico subjacente à adoção do Real Digital

    Authors: Marcio G B de Avellar, Alexandre A S Junior, André H G Lopes, André L S Carneiro, João A Pereira, Davi C B D da Cunha

    Abstract: We explore confidential computing in the context of CBDCs using Microsoft's CCF framework as an example. By developing an experiment and comparing different approaches and performance and security metrics, we seek to evaluate the effectiveness of confidential computing to improve the privacy, security, and performance of CBDCs. Preliminary results suggest that confidential computing could be a pro… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: Comments: 11 pages, 8 figures, in (Brazilian) Portuguese

  20. arXiv:2211.14372  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Interpretability Analysis of Deep Models for COVID-19 Detection

    Authors: Daniel Peixoto Pinto da Silva, Edresson Casanova, Lucas Rafael Stefanel Gris, Arnaldo Candido Junior, Marcelo Finger, Flaviane Svartman, Beatriz Raposo, Marcus Vinícius Moreira Martins, Sandra Maria Aluísio, Larissa Cristina Berti, João Paulo Teixeira

    Abstract: During the outbreak of COVID-19 pandemic, several research areas joined efforts to mitigate the damages caused by SARS-CoV-2. In this paper we present an interpretability analysis of a convolutional neural network based model for COVID-19 detection in audios. We investigate which features are important for model decision process, investigating spectrograms, F0, F0 standard deviation, sex and age.… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: 14 pages, 4 figures

  21. arXiv:2210.07852  [pdf, other

    cs.CL cs.SD eess.AS

    Bringing NURC/SP to Digital Life: the Role of Open-source Automatic Speech Recognition Models

    Authors: Lucas Rafael Stefanel Gris, Arnaldo Candido Junior, Vinícius G. dos Santos, Bruno A. Papa Dias, Marli Quadros Leite, Flaviane Romani Fernandes Svartman, Sandra Aluísio

    Abstract: The NURC Project that started in 1969 to study the cultured linguistic urban norm spoken in five Brazilian capitals, was responsible for compiling a large corpus for each capital. The digitized NURC/SP comprises 375 inquiries in 334 hours of recordings taken in São Paulo capital. Although 47 inquiries have transcripts, there was no alignment between the audio-transcription, and 328 inquiries were… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  22. Generalizing intrusion detection for heterogeneous networks: A stacked-unsupervised federated learning approach

    Authors: Gustavo de Carvalho Bertoli, Lourenço Alves Pereira Junior, Aldri Luiz dos Santos, Osamu Saotome

    Abstract: The constantly evolving digital transformation imposes new requirements on our society. Aspects relating to reliance on the networking domain and the difficulty of achieving security by design pose a challenge today. As a result, data-centric and machine-learning approaches arose as feasible solutions for securing large networks. Although, in the network security domain, ML-based solutions face a… ▽ More

    Submitted 28 November, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: Preprint (Under revision), 35 pages. Added repository link, see https://github.com/c2dc/fl-unsup-nids

  23. arXiv:2206.08537  [pdf, ps, other

    cs.CV cs.LG

    Large-Margin Representation Learning for Texture Classification

    Authors: Jonathan de Matos, Luiz Eduardo Soares de Oliveira, Alceu de Souza Britto Junior, Alessandro Lameiras Koerich

    Abstract: This paper presents a novel approach combining convolutional layers (CLs) and large-margin metric learning for training supervised models on small datasets for texture classification. The core of such an approach is a loss function that computes the distances between instances of interest and support vectors. The objective is to update the weights of CLs iteratively to learn a representation with… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: 7 pages

  24. arXiv:2206.01604  [pdf, other

    cs.LG nlin.CD

    Non-Intrusive Reduced Models based on Operator Inference for Chaotic Systems

    Authors: João Lucas de Sousa Almeida, Arthur Cancellieri Pires, Klaus Feine Vaz Cid, Alberto Costa Nogueira Junior

    Abstract: This work explores the physics-driven machine learning technique Operator Inference (OpInf) for predicting the state of chaotic dynamical systems. OpInf provides a non-intrusive approach to infer approximations of polynomial operators in reduced space without having access to the full order operators appearing in discretized models. Datasets for the physics systems are generated using conventional… ▽ More

    Submitted 21 September, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: 16 pages, 37 figures, accepted for publication in the IEEE-TAI-PIML

  25. arXiv:2205.12746   

    q-fin.CP cs.LG econ.EM q-fin.PR stat.ML

    Machine learning method for return direction forecasting of Exchange Traded Funds using classification and regression models

    Authors: Raphael P. B. Piovezan, Pedro Paulo de Andrade Junior

    Abstract: This article aims to propose and apply a machine learning method to analyze the direction of returns from Exchange Traded Funds (ETFs) using the historical return data of its components, helping to make investment strategy decisions through a trading algorithm. In methodological terms, regression and classification models were applied, using standard datasets from Brazilian and American markets, i… ▽ More

    Submitted 13 June, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Co-author did not agree with publishing here

  26. arXiv:2204.00618  [pdf, other

    eess.AS cs.CL cs.SD

    ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion

    Authors: Edresson Casanova, Christopher Shulby, Alexander Korolev, Arnaldo Candido Junior, Anderson da Silva Soares, Sandra Aluísio, Moacir Antonelli Ponti

    Abstract: We explore cross-lingual multi-speaker speech synthesis and cross-lingual voice conversion applied to data augmentation for automatic speech recognition (ASR) systems in low/medium-resource scenarios. Through extensive experiments, we show that our approach permits the application of speech synthesis and voice conversion to improve ASR systems using only one target-language speaker during model tr… ▽ More

    Submitted 20 May, 2023; v1 submitted 29 March, 2022; originally announced April 2022.

    Comments: This paper was accepted at INTERSPEECH 2023

  27. A Review of Deep Learning-based Approaches for Deepfake Content Detection

    Authors: Leandro A. Passos, Danilo Jodas, Kelton A. P. da Costa, Luis A. Souza Júnior, Douglas Rodrigues, Javier Del Ser, David Camacho, João Paulo Papa

    Abstract: Recent advancements in deep learning generative models have raised concerns as they can create highly convincing counterfeit images and videos. This poses a threat to people's integrity and can lead to social instability. To address this issue, there is a pressing need to develop new computational models that can efficiently detect forged content and alert users to potential image and video manipu… ▽ More

    Submitted 15 February, 2024; v1 submitted 12 February, 2022; originally announced February 2022.

  28. arXiv:2112.02418  [pdf, other

    cs.SD cs.CL eess.AS

    YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

    Authors: Edresson Casanova, Julian Weber, Christopher Shulby, Arnaldo Candido Junior, Eren Gölge, Moacir Antonelli Ponti

    Abstract: YourTTS brings the power of a multilingual approach to the task of zero-shot multi-speaker TTS. Our method builds upon the VITS model and adds several novel modifications for zero-shot multi-speaker and multilingual training. We achieved state-of-the-art (SOTA) results in zero-shot multi-speaker TTS and results comparable to SOTA in zero-shot voice conversion on the VCTK dataset. Additionally, our… ▽ More

    Submitted 30 April, 2023; v1 submitted 4 December, 2021; originally announced December 2021.

    Comments: An Erratum was added on the last page of this paper

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:2709-2720, 2022

  29. arXiv:2111.12126  [pdf, other

    cs.CV cs.AI cs.DB

    Panoptic Segmentation Meets Remote Sensing

    Authors: Osmar Luiz Ferreira de Carvalho, Osmar Abílio de Carvalho Júnior, Cristiano Rosa e Silva, Anesmar Olino de Albuquerque, Nickolas Castro Santana, Dibio Leandro Borges, Roberto Arnaldo Trancoso Gomes, Renato Fontes Guimarães

    Abstract: Panoptic segmentation combines instance and semantic predictions, allowing the detection of "things" and "stuff" simultaneously. Effectively approaching panoptic segmentation in remotely sensed data can be auspicious in many challenging problems since it allows continuous mapping and specific target counting. Several difficulties have prevented the growth of this task in remote sensing: (a) most a… ▽ More

    Submitted 30 November, 2021; v1 submitted 23 November, 2021; originally announced November 2021.

    Comments: 40 pages, 10 figures, submitted to journal

    MSC Class: I.4.6

  30. Bounding Box-Free Instance Segmentation Using Semi-Supervised Learning for Generating a City-Scale Vehicle Dataset

    Authors: Osmar Luiz Ferreira de Carvalho, Osmar Abílio de Carvalho Júnior, Anesmar Olino de Albuquerque, Nickolas Castro Santana, Dibio Leandro Borges, Roberto Arnaldo Trancoso Gomes, Renato Fontes Guimarães

    Abstract: Vehicle classification is a hot computer vision topic, with studies ranging from ground-view up to top-view imagery. In remote sensing, the usage of top-view images allows for understanding city patterns, vehicle concentration, traffic management, and others. However, there are some difficulties when aiming for pixel-wise classification: (a) most vehicle classification studies use object detection… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: 38 pages, 10 figures, submitted to journal

    MSC Class: I.4.6

  31. arXiv:2111.06161  [pdf, other

    cs.NI cs.LG cs.SI

    Understanding mobility in networks: A node embedding approach

    Authors: Matheus F. C. Barros, Carlos H. G. Ferreira, Bruno Pereira dos Santos, Lourenço A. P. Júnior, Marco Mellia, Jussara M. Almeida

    Abstract: Motivated by the growing number of mobile devices capable of connecting and exchanging messages, we propose a methodology aiming to model and analyze node mobility in networks. We note that many existing solutions in the literature rely on topological measurements calculated directly on the graph of node contacts, aiming to capture the notion of the node's importance in terms of connectivity and m… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

  32. arXiv:2110.15731  [pdf, other

    cs.CL cs.SD eess.AS

    CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese

    Authors: Arnaldo Candido Junior, Edresson Casanova, Anderson Soares, Frederico Santos de Oliveira, Lucas Oliveira, Ricardo Corso Fernandes Junior, Daniel Peixoto Pinto da Silva, Fernando Gorgulho Fayet, Bruno Baldissera Carlotto, Lucas Rafael Stefanel Gris, Sandra Maria Aluísio

    Abstract: Automatic Speech recognition (ASR) is a complex and challenging task. In recent years, there have been significant advances in the area. In particular, for the Brazilian Portuguese (BP) language, there were about 376 hours public available for ASR task until the second half of 2020. With the release of new datasets in early 2021, this number increased to 574 hours. The existing resources, however,… ▽ More

    Submitted 18 November, 2021; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: This paper is under consideration at Language Resources and Evaluation (LREV)

  33. arXiv:2110.13655  [pdf, other

    cs.CR cs.AI cs.LG

    Bridging the gap to real-world for network intrusion detection systems with data-centric approach

    Authors: Gustavo de Carvalho Bertoli, Lourenço Alves Pereira Junior, Filipe Alves Neto Verri, Aldri Luiz dos Santos, Osamu Saotome

    Abstract: Most research using machine learning (ML) for network intrusion detection systems (NIDS) uses well-established datasets such as KDD-CUP99, NSL-KDD, UNSW-NB15, and CICIDS-2017. In this context, the possibilities of machine learning techniques are explored, aiming for metrics improvements compared to the published baselines (model-centric approach). However, those datasets present some limitations a… ▽ More

    Submitted 8 January, 2022; v1 submitted 25 October, 2021; originally announced October 2021.

    Comments: Camera-ready version from Data-centric AI workshop at NeurIPS 2021, see https://datacentricai.org/papers/104_CameraReady_dcaicamera-ready.pdf

  34. arXiv:2107.14235  [pdf, other

    q-bio.NC cs.LG

    EEG multipurpose eye blink detector using convolutional neural network

    Authors: Amanda Ferrari Iaquinta, Ana Carolina de Sousa Silva, Aldrumont Ferraz Júnior, Jessica Monique de Toledo, Gustavo Voltani von Atzingen

    Abstract: The electrical signal emitted by the eyes movement produces a very strong artifact on EEG signaldue to its close proximity to the sensors and abundance of occurrence. In the context of detectingeye blink artifacts in EEG waveforms for further removal and signal purification, multiple strategieswhere proposed in the literature. Most commonly applied methods require the use of a large numberof elect… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

  35. arXiv:2107.11414  [pdf, other

    cs.CL

    Brazilian Portuguese Speech Recognition Using Wav2vec 2.0

    Authors: Lucas Rafael Stefanel Gris, Edresson Casanova, Frederico Santos de Oliveira, Anderson da Silva Soares, Arnaldo Candido Junior

    Abstract: Deep learning techniques have been shown to be efficient in various tasks, especially in the development of speech recognition systems, that is, systems that aim to transcribe an audio sentence in a sequence of written words. Despite the progress in the area, speech recognition can still be considered difficult, especially for languages lacking available data, such as Brazilian Portuguese (BP). In… ▽ More

    Submitted 22 December, 2021; v1 submitted 23 July, 2021; originally announced July 2021.

  36. arXiv:2106.08499  [pdf, other

    cs.CV cs.AI cs.LG

    ICDAR 2021 Competition on Components Segmentation Task of Document Photos

    Authors: Celso A. M. Lopes Junior, Ricardo B. das Neves Junior, Byron L. D. Bezerra, Alejandro H. Toselli, Donato Impedovo

    Abstract: This paper describes the short-term competition on the Components Segmentation Task of Document Photos that was prepared in the context of the 16th International Conference on Document Analysis and Recognition (ICDAR 2021). This competition aims to bring together researchers working in the field of identification document image processing and provides them a suitable benchmark to compare their tec… ▽ More

    Submitted 8 July, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: 15 pages; 5 figures; Accepted at ICDAR 2021: 16th International Conference on Document Analysis and Recognition

  37. arXiv:2104.05557  [pdf, other

    eess.AS cs.SD

    SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model

    Authors: Edresson Casanova, Christopher Shulby, Eren Gölge, Nicolas Michael Müller, Frederico Santos de Oliveira, Arnaldo Candido Junior, Anderson da Silva Soares, Sandra Maria Aluisio, Moacir Antonelli Ponti

    Abstract: In this paper, we propose SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model that improves similarity for speakers unseen during training. We propose a speaker-conditional architecture that explores a flow-based decoder that works in a zero-shot scenario. As text encoders, we explore a dilated residual convolutional-based encoder, gated convolutional-based encoder, and transform… ▽ More

    Submitted 15 June, 2021; v1 submitted 2 April, 2021; originally announced April 2021.

    Comments: Accepted on Interspeech 2021

  38. arXiv:2008.07965  [pdf, other

    cs.AI cs.LG cs.RO

    Analysis of Social Robotic Navigation approaches: CNN Encoder and Incremental Learning as an alternative to Deep Reinforcement Learning

    Authors: Janderson Ferreira, Agostinho A. F. Júnior, Letícia Castro, Yves M. Galvão, Pablo Barros, Bruno J. T. Fernandes

    Abstract: Dealing with social tasks in robotic scenarios is difficult, as having humans in the learning loop is incompatible with most of the state-of-the-art machine learning algorithms. This is the case when exploring Incremental learning models, in particular the ones involving reinforcement learning. In this work, we discuss this problem and possible solutions by analysing a previous study on adaptive c… ▽ More

    Submitted 5 September, 2020; v1 submitted 18 August, 2020; originally announced August 2020.

  39. arXiv:2008.02254  [pdf, other

    cs.CV cs.AI cs.LG

    Performance Improvement of Path Planning algorithms with Deep Learning Encoder Model

    Authors: Janderson Ferreira, Agostinho A. F. Júnior, Yves M. Galvão, Pablo Barros, Sergio Murilo Maciel Fernandes, Bruno J. T. Fernandes

    Abstract: Currently, path planning algorithms are used in many daily tasks. They are relevant to find the best route in traffic and make autonomous robots able to navigate. The use of path planning presents some issues in large and dynamic environments. Large environments make these algorithms spend much time finding the shortest path. On the other hand, dynamic environments request a new execution of the a… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.

  40. arXiv:2005.14229  [pdf, other

    cs.CV

    FCN+RL: A Fully Convolutional Network followed by Refinement Layers to Offline Handwritten Signature Segmentation

    Authors: Celso A. M. Lopes Junior, Matheus Henrique M. da Silva, Byron Leite Dantas Bezerra, Bruno Jose Torres Fernandes, Donato Impedovo

    Abstract: Although secular, handwritten signature is one of the most reliable biometric methods used by most countries. In the last ten years, the application of technology for verification of handwritten signatures has evolved strongly, including forensic aspects. Some factors, such as the complexity of the background and the small size of the region of interest - signature pixels - increase the difficulty… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    Comments: 7 pages, 6 figures, Accepted at IJCNN 2020: International Joint Conference on Neural Networks

  41. arXiv:2005.08125  [pdf, other

    physics.soc-ph cs.SI q-bio.PE

    Social Interaction Layers in Complex Networks for the Dynamical Epidemic Modeling of COVID-19 in Brazil

    Authors: Leonardo F. S. Scabini, Lucas C. Ribas, Mariane B. Neiva, Altamir G. B. Junior, Alex J. F. Farfán, Odemir M. Bruno

    Abstract: We are currently living in a state of uncertainty due to the pandemic caused by the Sars-CoV-2 virus. There are several factors involved in the epidemic spreading such as the individual characteristics of each city/country. The true shape of the epidemic dynamics is a large, complex system such as most of the social systems. In this context, Complex networks are a great candidate to analyze these… ▽ More

    Submitted 20 May, 2020; v1 submitted 16 May, 2020; originally announced May 2020.

    Comments: 16 pages, 7 figures, 2 tables

    MSC Class: 05C82 (Primary) 05C81; 92C60; 37M05 (Secondary)

  42. arXiv:2005.05144  [pdf, other

    eess.AS cs.CL cs.LG

    TTS-Portuguese Corpus: a corpus for speech synthesis in Brazilian Portuguese

    Authors: Edresson Casanova, Arnaldo Candido Junior, Christopher Shulby, Frederico Santos de Oliveira, João Paulo Teixeira, Moacir Antonelli Ponti, Sandra Maria Aluisio

    Abstract: Speech provides a natural way for human-computer interaction. In particular, speech synthesis systems are popular in different applications, such as personal assistants, GPS applications, screen readers and accessibility tools. However, not all languages are on the same level when in terms of resources and systems for speech synthesis. This work consists of creating publicly available resources fo… ▽ More

    Submitted 29 January, 2022; v1 submitted 11 May, 2020; originally announced May 2020.

  43. arXiv:2004.12554  [pdf, other

    cs.LG cs.AI cs.CE stat.ML

    Forecasting in Non-stationary Environments with Fuzzy Time Series

    Authors: Petrônio Cândido de Lima e Silva, Carlos Alberto Severiano Junior, Marcos Antonio Alves, Rodrigo Silva, Miri Weiss Cohen, Frederico Gadelha Guimarães

    Abstract: In this paper we introduce a Non-Stationary Fuzzy Time Series (NSFTS) method with time varying parameters adapted from the distribution of the data. In this approach, we employ Non-Stationary Fuzzy Sets, in which perturbation functions are used to adapt the membership function parameters in the knowledge base in response to statistical changes in the time series. The proposed method is capable of… ▽ More

    Submitted 26 April, 2020; originally announced April 2020.

    Comments: 21 pages, 7 figures, submitted to Applied Soft Computing

  44. arXiv:2004.05077  [pdf, ps, other

    cs.AI cs.CV eess.IV

    CNN Encoder to Reduce the Dimensionality of Data Image for Motion Planning

    Authors: Janderson Ferreira, Agostinho A. F. Júnior, Yves M. Galvão, Bruno J. T. Fernandes, Pablo Barros

    Abstract: Many real-world applications need path planning algorithms to solve tasks in different areas, such as social applications, autonomous cars, and tracking activities. And most importantly motion planning. Although the use of path planning is sufficient in most motion planning scenarios, they represent potential bottlenecks in large environments with dynamic changes. To tackle this problem, the numbe… ▽ More

    Submitted 10 April, 2020; originally announced April 2020.

  45. arXiv:2002.11213  [pdf, other

    cs.CL cs.SD eess.AS

    Speech2Phone: A Novel and Efficient Method for Training Speaker Recognition Models

    Authors: Edresson Casanova, Arnaldo Candido Junior, Christopher Shulby, Frederico Santos de Oliveira, Lucas Rafael Stefanel Gris, Hamilton Pereira da Silva, Sandra Maria Aluisio, Moacir Antonelli Ponti

    Abstract: In this paper we present an efficient method for training models for speaker recognition using small or under-resourced datasets. This method requires less data than other SOTA (State-Of-The-Art) methods, e.g. the Angular Prototypical and GE2E loss functions, while achieving similar results to those methods. This is done using the knowledge of the reconstruction of a phoneme in the speaker's voice… ▽ More

    Submitted 18 June, 2021; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: Submitted to BRACIS

  46. arXiv:1911.07673  [pdf, other

    cs.DB cs.AI cs.IR

    Using Mapping Languages for Building Legal Knowledge Graphs from XML Files

    Authors: Ademar Crotti Junior, Fabrizio Orlandi, Declan O'Sullivan, Christian Dirschl, Quentin Reul

    Abstract: This paper presents our experience on building RDF knowledge graphs for an industrial use case in the legal domain. The information contained in legal information systems are often accessed through simple keyword interfaces and presented as a simple list of hits. In order to improve search accuracy one may avail of knowledge graphs, where the semantics of the data can be made explicit. Significant… ▽ More

    Submitted 18 November, 2019; originally announced November 2019.

    Comments: Presented at the 2nd International Contextualized Knowledge Graphs Workshop (CKG'19) at the 18th International Semantic Web Conference (ISWC) 2019

  47. arXiv:1910.06155  [pdf

    cs.OH stat.AP

    GeoSES -- um Índice Socioeconômico para Estudos de Saúde no Brasil

    Authors: Ligia Vizeu Barrozo, Michel Fornaciali, Carmen Diva Saldiva de André, Guilherme Augusto Zimeo Morais, Giselle Mansur, William Cabral-Miranda, João Ricardo Sato, Edson Amaro Júnior

    Abstract: Objective: to define an index that summarizes the main dimensions of the socioeconomic context for research purposes, evaluation and monitoring health inequalities. Methods: the index was created from the 2010 Brazilian Demographic Census, whose variables selection was guided by theoretical references for health studies, including seven socioeconomic dimensions: education, mobility, poverty, wealt… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: in Portuguese

  48. arXiv:1909.09978  [pdf, other

    cs.LG stat.ML

    Minimal Learning Machine: Theoretical Results and Clustering-Based Reference Point Selection

    Authors: Joonas Hämäläinen, Alisson S. C. Alencar, Tommi Kärkkäinen, César L. C. Mattos, Amauri H. Souza Júnior, João P. P. Gomes

    Abstract: The Minimal Learning Machine (MLM) is a nonlinear supervised approach based on learning a linear mapping between distance matrices computed in the input and output data spaces, where distances are calculated using a subset of points called reference points. Its simple formulation has attracted several recent works on extensions and applications. In this paper, we aim to address some open questions… ▽ More

    Submitted 6 October, 2020; v1 submitted 22 September, 2019; originally announced September 2019.

    Comments: 29 pages, Accepted to JMLR

  49. arXiv:1908.10980  [pdf

    cs.NI

    vSDNEmul: A Software-Defined Network Emulator Based on Container Virtualization

    Authors: Fernando N. N. Farias, Antônio de O. Junior, Leonardo B. da Costa, Billy A. Pinheiro, Antônio J. G. Abelém

    Abstract: The main issue related to Software-Defined Network emulators is how to replicate real behavior in experiments. Mininet and others SDN emulators have an architecture that limits both the scope of experiments and the fidelity of networking tests. Consequently, the serialization, contention, and load of background processes may produce delays that compromise the operation of events such as transmitti… ▽ More

    Submitted 28 August, 2019; originally announced August 2019.

    Journal ref: International Journal of Simulation Systems, Science & Technology Volume 20, Number 4, August 2019

  50. arXiv:1908.08651  [pdf

    cs.CY

    Trajectory-Based Urban Air Mobility (UAM) Operations Simulator (TUS)

    Authors: Euclides C. Pinto Neto, Derick M. Baum, Jorge Rady de Almeida Junior, João Batista Camargo Junior, Paulo Sérgio Cugnasca

    Abstract: Nowadays, the demand for optimized services in urban environments to provide better society wellness is increasing. In this sense, ground transportation in dense urban environments has been facing challenges for many years (e.g., congestion and resilience). One import outcome of the effort made toward the creation of new concepts for enhancing urban transportation is the Urban Air Mobility (UAM) c… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.