Skip to main content

Showing 1–18 of 18 results for author: Oliveira, F S

  1. arXiv:2310.16148  [pdf, other

    cs.CV cs.AI

    Yin Yang Convolutional Nets: Image Manifold Extraction by the Analysis of Opposites

    Authors: Augusto Seben da Rosa, Frederico Santos de Oliveira, Anderson da Silva Soares, Arnaldo Candido Junior

    Abstract: Computer vision in general presented several advances such as training optimizations, new architectures (pure attention, efficient block, vision language models, generative models, among others). This have improved performance in several tasks such as classification, and others. However, the majority of these models focus on modifications that are taking distance from realistic neuroscientific app… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 12 pages, 5 tables and 6 figures

    ACM Class: I.2.10

  2. arXiv:2306.10097  [pdf, other

    eess.AS cs.AI cs.CL

    CML-TTS A Multilingual Dataset for Speech Synthesis in Low-Resource Languages

    Authors: Frederico S. Oliveira, Edresson Casanova, Arnaldo Cândido Júnior, Anderson S. Soares, Arlindo R. Galvão Filho

    Abstract: In this paper, we present CML-TTS, a recursive acronym for CML-Multi-Lingual-TTS, a new Text-to-Speech (TTS) dataset developed at the Center of Excellence in Artificial Intelligence (CEIA) of the Federal University of Goias (UFG). CML-TTS is based on Multilingual LibriSpeech (MLS) and adapted for training TTS models, consisting of audiobooks in seven languages: Dutch, French, German, Italian, Port… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 12 pages, 5 figures, Accepted at the 25th International Conference on Text, Speech and Dialogue (TSD 2022)

  3. arXiv:2306.09979  [pdf, other

    cs.SD cs.AI eess.AS

    Evaluation of Speech Representations for MOS prediction

    Authors: Frederico S. Oliveira, Edresson Casanova, Arnaldo Cândido Júnior, Lucas R. S. Gris, Anderson S. Soares, Arlindo R. Galvão Filho

    Abstract: In this paper, we evaluate feature extraction models for predicting speech quality. We also propose a model architecture to compare embeddings of supervised learning and self-supervised learning models with embeddings of speaker verification models to predict the metric MOS. Our experiments were performed on the VCC2018 dataset and a Brazilian-Portuguese dataset called BRSpeechMOS, which was creat… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 12 pages, 4 figures, Accepted to the 26th International Conference of Text, Speech and Dialogue (TSD2023)

  4. arXiv:2303.06070  [pdf, ps, other

    math.CO cs.DM

    Thinness and its variations on some graph families and coloring graphs of bounded thinness

    Authors: Flavia Bonomo-Braberman, Eric Brandwein, Fabiano S. Oliveira, Moysés S. Sampaio Jr., Agustin Sansone, Jayme L. Szwarcfiter

    Abstract: Interval graphs and proper interval graphs are well known graph classes, for which several generalizations have been proposed in the literature. In this work, we study the (proper) thinness, and several variations, for the classes of cographs, crowns graphs and grid graphs. We provide the exact values for several variants of thinness (proper, independent, complete, precedence, and combinations o… ▽ More

    Submitted 2 February, 2024; v1 submitted 10 March, 2023; originally announced March 2023.

    MSC Class: 05C15; 05C62; 05C75

  5. arXiv:2206.11846  [pdf, other

    cs.SI

    Analysis of account behaviors in Ethereum during an economic impact event

    Authors: Pedro Henrique F. S. Oliveira, Daniel Muller Rezende, Heder Soares Bernardino, Saulo Moraes Villela, Alex Borges Vieira

    Abstract: One of the main events that involve the world economy in 2022 is the conflict between Russia and Ukraine. This event offers a rare opportunity to analyze how events of this magnitude can reflect the use of cryptocurrencies. This work aims to investigate the behavior of accounts and their transactions on the Ethereum cryptocurrency during this event. To this end, we collected all transactions that… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: 13 pages, 5 figures

  6. arXiv:2203.04250  [pdf, other

    cs.DM cs.DS

    Edge Intersection Graphs of Paths on a Triangular Grid

    Authors: Vitor T. F. de Luca, María Pía Mazzoleni, Fabiano S. Oliveira, Tanilson D. Santos, Jayme L. Szwarcfiter

    Abstract: We introduce a new class of intersection graphs, the edge intersection graphs of paths on a triangular grid, called EPGt graphs. We show similarities and differences from this new class to the well-known class of EPG graphs. A turn of a path at a grid point is called a bend. An EPGt representation in which every path has at most $k$ bends is called a B$_k$-EPGt representation and the corresponding… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Comments: 19 pages, 12 figures

    MSC Class: 05C85; 05C05

  7. arXiv:2202.13955  [pdf, other

    cs.CC cs.DM math.CO

    MaxCut on Permutation Graphs is NP-complete

    Authors: Celina M. H. de Figueiredo, Alexsander A. de Melo, Fabiano S. Oliveira, Ana Silva

    Abstract: In this paper, we prove that the MaxCut problem is NP-complete on permutation graphs, settling a long-standing open problem that appeared in the 1985 column of the "Ongoing Guide to NP-completeness" by David S. Johnson.

    Submitted 28 February, 2022; originally announced February 2022.

  8. arXiv:2110.15731  [pdf, other

    cs.CL cs.SD eess.AS

    CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese

    Authors: Arnaldo Candido Junior, Edresson Casanova, Anderson Soares, Frederico Santos de Oliveira, Lucas Oliveira, Ricardo Corso Fernandes Junior, Daniel Peixoto Pinto da Silva, Fernando Gorgulho Fayet, Bruno Baldissera Carlotto, Lucas Rafael Stefanel Gris, Sandra Maria Aluísio

    Abstract: Automatic Speech recognition (ASR) is a complex and challenging task. In recent years, there have been significant advances in the area. In particular, for the Brazilian Portuguese (BP) language, there were about 376 hours public available for ASR task until the second half of 2020. With the release of new datasets in early 2021, this number increased to 574 hours. The existing resources, however,… ▽ More

    Submitted 18 November, 2021; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: This paper is under consideration at Language Resources and Evaluation (LREV)

  9. arXiv:2109.02733  [pdf, other

    math.CO cs.CC cs.DM

    Minimum Number of Bends of Paths of Trees in a Grid Embedding

    Authors: V. T. F. Luca, F. S. Oliveira, J. L. Szwarcfiter

    Abstract: We are interested in embedding trees T with maximum degree at most four in a rectangular grid, such that the vertices of T correspond to grid points, while edges of T correspond to non-intersecting straight segments of the grid lines. Such embeddings are called straight models. While each edge is represented by a straight segment, a path of T is represented in the model by the union of the segment… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: 10 pages, 6 figures

    MSC Class: 05C85; 05C05

  10. arXiv:2107.11414  [pdf, other

    cs.CL

    Brazilian Portuguese Speech Recognition Using Wav2vec 2.0

    Authors: Lucas Rafael Stefanel Gris, Edresson Casanova, Frederico Santos de Oliveira, Anderson da Silva Soares, Arnaldo Candido Junior

    Abstract: Deep learning techniques have been shown to be efficient in various tasks, especially in the development of speech recognition systems, that is, systems that aim to transcribe an audio sentence in a sequence of written words. Despite the progress in the area, speech recognition can still be considered difficult, especially for languages lacking available data, such as Brazilian Portuguese (BP). In… ▽ More

    Submitted 22 December, 2021; v1 submitted 23 July, 2021; originally announced July 2021.

  11. arXiv:2106.05312  [pdf, other

    math.CO cs.DM

    B1-EPG representations using block-cutpoint trees

    Authors: V. T. F. Luca, F. S. Oliveira, J. L. Szwarcfiter

    Abstract: In this paper, we are interested in the edge intersection graphs of paths of a grid where each path has at most one bend, called B1-EPG graphs and first introduced by Golumbic et al (2009). We also consider a proper subclass of B1-EPG, the L-EPG graphs, which allows paths only in ``L'' shape. We show that two superclasses of trees are B1-EPG (one of them being the cactus graphs). On the other hand… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: 9 pages, 13 figures

    MSC Class: 05Cxx

  12. arXiv:2104.05557  [pdf, other

    eess.AS cs.SD

    SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model

    Authors: Edresson Casanova, Christopher Shulby, Eren Gölge, Nicolas Michael Müller, Frederico Santos de Oliveira, Arnaldo Candido Junior, Anderson da Silva Soares, Sandra Maria Aluisio, Moacir Antonelli Ponti

    Abstract: In this paper, we propose SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model that improves similarity for speakers unseen during training. We propose a speaker-conditional architecture that explores a flow-based decoder that works in a zero-shot scenario. As text encoders, we explore a dilated residual convolutional-based encoder, gated convolutional-based encoder, and transform… ▽ More

    Submitted 15 June, 2021; v1 submitted 2 April, 2021; originally announced April 2021.

    Comments: Accepted on Interspeech 2021

  13. arXiv:2012.09804  [pdf, other

    cs.CC cs.DM

    Maximum cut on interval graphs of interval count four is NP-complete

    Authors: Celina M. H. de Figueiredo, Alexsander A. de Melo, Fabiano S. Oliveira, Ana Silva

    Abstract: The computational complexity of the MaxCut problem restricted to interval graphs has been open since the 80's, being one of the problems proposed by Johnson on his Ongoing Guide to NP-completeness, and has been settled as NP-complete only recently by Adhikary, Bose, Mukherjee and Roy. On the other hand, many flawed proofs of polynomiality for MaxCut on the more restrictive class of unit/proper int… ▽ More

    Submitted 29 November, 2022; v1 submitted 17 December, 2020; originally announced December 2020.

    MSC Class: 68Q17; 68Q25; 68R10; 05C62 ACM Class: F.2.2

  14. Precedence thinness in graphs

    Authors: Flavia Bonomo-Braberman, Fabiano S. Oliveira, Moysés S. Sampaio Jr., Jayme L. Szwarcfiter

    Abstract: Interval and proper interval graphs are very well-known graph classes, for which there is a wide literature. As a consequence, some generalizations of interval graphs have been proposed, in which graphs in general are expressed in terms of $k$ interval graphs, by splitting the graph in some special way. As a recent example of such an approach, the classes of $k$-thin and proper $k$-thin graphs h… ▽ More

    Submitted 30 June, 2020; originally announced June 2020.

    Comments: 33 pages

    MSC Class: 05C75; 05C85 ACM Class: G.2.2

    Journal ref: Discrete Applied Mathematics 323 (2022), 76-95

  15. Thinness of product graphs

    Authors: Flavia Bonomo-Braberman, Carolina L. Gonzalez, Fabiano S. Oliveira, Moysés S. Sampaio Jr., Jayme L. Szwarcfiter

    Abstract: The thinness of a graph is a width parameter that generalizes some properties of interval graphs, which are exactly the graphs of thinness one. Many NP-complete problems can be solved in polynomial time for graphs with bounded thinness, given a suitable representation of the graph. In this paper we study the thinness and its variations of graph products. We show that the thinness behaves "well" in… ▽ More

    Submitted 16 April, 2021; v1 submitted 30 June, 2020; originally announced June 2020.

    Comments: 45 pages. arXiv admin note: text overlap with arXiv:1704.00379

    MSC Class: 05C76 ACM Class: G.2.2

    Journal ref: Discrete Applied Mathematics 312 (2022), 52-71

  16. arXiv:2005.05144  [pdf, other

    eess.AS cs.CL cs.LG

    TTS-Portuguese Corpus: a corpus for speech synthesis in Brazilian Portuguese

    Authors: Edresson Casanova, Arnaldo Candido Junior, Christopher Shulby, Frederico Santos de Oliveira, João Paulo Teixeira, Moacir Antonelli Ponti, Sandra Maria Aluisio

    Abstract: Speech provides a natural way for human-computer interaction. In particular, speech synthesis systems are popular in different applications, such as personal assistants, GPS applications, screen readers and accessibility tools. However, not all languages are on the same level when in terms of resources and systems for speech synthesis. This work consists of creating publicly available resources fo… ▽ More

    Submitted 29 January, 2022; v1 submitted 11 May, 2020; originally announced May 2020.

  17. Linear-time Algorithms for Eliminating Claws in Graphs

    Authors: Flavia Bonomo-Braberman, Julliano R. Nascimento, Fabiano S. Oliveira, Uéverton S. Souza, Jayme L. Szwarcfiter

    Abstract: Since many NP-complete graph problems have been shown polynomial-time solvable when restricted to claw-free graphs, we study the problem of determining the distance of a given graph to a claw-free graph, considering vertex elimination as measure. CLAW-FREE VERTEX DELETION (CFVD) consists of determining the minimum number of vertices to be removed from a graph such that the resulting graph is claw-… ▽ More

    Submitted 12 April, 2020; originally announced April 2020.

    Comments: 20 pages

    Journal ref: International Transactions in Operational Research 31 (2024), 296--315

  18. arXiv:2002.11213  [pdf, other

    cs.CL cs.SD eess.AS

    Speech2Phone: A Novel and Efficient Method for Training Speaker Recognition Models

    Authors: Edresson Casanova, Arnaldo Candido Junior, Christopher Shulby, Frederico Santos de Oliveira, Lucas Rafael Stefanel Gris, Hamilton Pereira da Silva, Sandra Maria Aluisio, Moacir Antonelli Ponti

    Abstract: In this paper we present an efficient method for training models for speaker recognition using small or under-resourced datasets. This method requires less data than other SOTA (State-Of-The-Art) methods, e.g. the Angular Prototypical and GE2E loss functions, while achieving similar results to those methods. This is done using the knowledge of the reconstruction of a phoneme in the speaker's voice… ▽ More

    Submitted 18 June, 2021; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: Submitted to BRACIS