Skip to main content

Showing 1–4 of 4 results for author: Ratnaparkhe, M

  1. arXiv:2312.14920  [pdf, ps, other

    cs.LG cs.AI

    A Novel Sampled Clustering Algorithm for Rice Phenotypic Data

    Authors: Mithun Singh, Kapil Ahuja, Milind B. Ratnaparkhe

    Abstract: Phenotypic (or Physical) characteristics of plant species are commonly used to perform clustering. In one of our recent works (Shastri et al. (2021)), we used a probabilistically sampled (using pivotal sampling) and spectrally clustered algorithm to group soybean species. These techniques were used to obtain highly accurate clusterings at a reduced cost. In this work, we extend the earlier algorit… ▽ More

    Submitted 12 May, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: 31 Pages, 3 Figures, 7 Tables

    MSC Class: 68T01; 68T10 ACM Class: I.2.1; I.5.3

  2. arXiv:2204.11835  [pdf, other

    q-bio.QM cs.AI cs.LG

    A Novel Scalable Apache Spark Based Feature Extraction Approaches for Huge Protein Sequence and their Clustering Performance Analysis

    Authors: Preeti Jha, Aruna Tiwari, Neha Bharill, Milind Ratnaparkhe, Om Prakash Patel, Nilagiri Harshith, Mukkamalla Mounika, Neha Nagendra

    Abstract: Genome sequencing projects are rapidly increasing the number of high-dimensional protein sequence datasets. Clustering a high-dimensional protein sequence dataset using traditional machine learning approaches poses many challenges. Many different feature extraction methods exist and are widely used. However, extracting features from millions of protein sequences becomes impractical because they ar… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

  3. arXiv:2009.09028  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Probabilistically Sampled and Spectrally Clustered Plant Genotypes using Phenotypic Characteristics

    Authors: Aditya A. Shastri, Kapil Ahuja, Milind B. Ratnaparkhe, Yann Busnel

    Abstract: Clustering genotypes based upon their phenotypic characteristics is used to obtain diverse sets of parents that are useful in their breeding programs. The Hierarchical Clustering (HC) algorithm is the current standard in clustering of phenotypic data. This algorithm suffers from low accuracy and high computational complexity issues. To address the accuracy challenge, we propose the use of Spectral… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    Comments: 16 Pages, 3 Figures, and 6 Tables

    MSC Class: 92B05; 68T09 ACM Class: I.2.1; J.3

  4. arXiv:1810.00398  [pdf

    q-bio.QM cs.LG stat.ML

    Vector Quantized Spectral Clustering applied to Soybean Whole Genome Sequences

    Authors: Aditya A. Shastri, Kapil Ahuja, Milind B. Ratnaparkhe, Aditya Shah, Aishwary Gagrani, Anant Lal

    Abstract: We develop a Vector Quantized Spectral Clustering (VQSC) algorithm that is a combination of Spectral Clustering (SC) and Vector Quantization (VQ) sampling for grouping Soybean genomes. The inspiration here is to use SC for its accuracy and VQ to make the algorithm computationally cheap (the complexity of SC is cubic in-terms of the input size). Although the combination of SC and VQ is not new, the… ▽ More

    Submitted 30 September, 2018; originally announced October 2018.

    Comments: 10 Pages, 3 Tables, 2 Figures

    MSC Class: 68T01; 68T10; 68W40