Skip to main content

Showing 1–22 of 22 results for author: Rajan, V

  1. arXiv:2405.04078  [pdf, other

    cs.LG cs.AI q-bio.QM

    WISER: Weak supervISion and supErvised Representation learning to improve drug response prediction in cancer

    Authors: Kumar Shubham, Aishwarya Jayagopal, Syed Mohammed Danish, Prathosh AP, Vaibhav Rajan

    Abstract: Cancer, a leading cause of death globally, occurs due to genomic changes and manifests heterogeneously across patients. To advance research on personalized treatment strategies, the effectiveness of various drugs on cells derived from cancers (`cell lines') is experimentally determined in laboratory settings. Nevertheless, variations in the distribution of genomic data and drug responses between c… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  2. arXiv:2402.10551  [pdf, other

    cs.LG q-bio.QM

    Personalised Drug Identifier for Cancer Treatment with Transformers using Auxiliary Information

    Authors: Aishwarya Jayagopal, Hansheng Xue, Ziyang He, Robert J. Walsh, Krishna Kumar Hariprasannan, David Shao Peng Tan, Tuan Zea Tan, Jason J. Pitt, Anand D. Jeyasekharan, Vaibhav Rajan

    Abstract: Cancer remains a global challenge due to its growing clinical and economic burden. Its uniquely personal manifestation, which makes treatment difficult, has fuelled the quest for personalized treatment strategies. Thus, genomic profiling is increasingly becoming part of clinical diagnostic panels. Effective use of such panels requires accurate drug response prediction (DRP) models, which are chall… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  3. arXiv:2402.10229  [pdf, other

    stat.CO cs.LG

    Mixture-Models: a one-stop Python Library for Model-based Clustering using various Mixture Models

    Authors: Siva Rajesh Kasa, Hu Yijie, Santhosh Kumar Kasa, Vaibhav Rajan

    Abstract: \texttt{Mixture-Models} is an open-source Python library for fitting Gaussian Mixture Models (GMM) and their variants, such as Parsimonious GMMs, Mixture of Factor Analyzers, MClust models, Mixture of Student's t distributions, etc. It streamlines the implementation and analysis of these models using various first/second order optimization routines such as Gradient Descent and Newton-CG through au… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  4. arXiv:2401.12085  [pdf, other

    eess.AS cs.SD

    Consistency Based Unsupervised Self-training For ASR Personalisation

    Authors: Jisi Zhang, Vandana Rajan, Haaris Mehmood, David Tuckey, Pablo Peso Parada, Md Asif Jalal, Karthikeyan Saravanan, Gil Ho Lee, Jungin Lee, Seokyeong Jung

    Abstract: On-device Automatic Speech Recognition (ASR) models trained on speech data of a large population might underperform for individuals unseen during training. This is due to a domain shift between user data and the original training data, differed by user's speaking characteristics and environmental acoustic conditions. ASR personalisation is a solution that aims to exploit user data to improve model… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted for IEEE ASRU 2023

  5. arXiv:2401.03181  [pdf

    cs.CL

    A Joint-Reasoning based Disease Q&A System

    Authors: Prakash Chandra Sukhwal, Vaibhav Rajan, Atreyi Kankanhalli

    Abstract: Medical question answer (QA) assistants respond to lay users' health-related queries by synthesizing information from multiple sources using natural language processing and related techniques. They can serve as vital tools to alleviate issues of misinformation, information overload, and complexity of medical language, thus addressing lay users' information needs while reducing the burden on health… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: 36 pages, 6 figures, submitted to TMIS on 14 July 2023 (status: under review)

  6. arXiv:2210.12158  [pdf, other

    q-bio.GN cs.LG

    Graph Coloring via Neural Networks for Haplotype Assembly and Viral Quasispecies Reconstruction

    Authors: Hansheng Xue, Vaibhav Rajan, Yu Lin

    Abstract: Understanding genetic variation, e.g., through mutations, in organisms is crucial to unravel their effects on the environment and human health. A fundamental characterization can be obtained by solving the haplotype assembly problem, which yields the variation across multiple copies of chromosomes. Variations among fast evolving viruses that lead to different strains (called quasispecies) are also… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted by NeurIPS 2022

  7. arXiv:2202.09263  [pdf, other

    cs.LG cs.MM

    Is Cross-Attention Preferable to Self-Attention for Multi-Modal Emotion Recognition?

    Authors: Vandana Rajan, Alessio Brutti, Andrea Cavallaro

    Abstract: Humans express their emotions via facial expressions, voice intonation and word choices. To infer the nature of the underlying emotion, recognition models may use a single modality, such as vision, audio, and text, or a combination of modalities. Generally, models that fuse complementary information from multiple modalities outperform their uni-modal counterparts. However, a successful model that… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

    Comments: Accepted at ICASSP 2022

  8. arXiv:2201.06344  [pdf, other

    cs.LG

    ExpertNet: A Symbiosis of Classification and Clustering

    Authors: Shivin Srivastava, Kenji Kawaguchi, Vaibhav Rajan

    Abstract: A widely used paradigm to improve the generalization performance of high-capacity neural models is through the addition of auxiliary unsupervised tasks during supervised training. Tasks such as similarity matching and input reconstruction have been shown to provide a beneficial regularizing effect by guiding representation learning. Real data often has complex underlying structures and may be comp… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

    Comments: 16 pages, 3 figures

  9. arXiv:2112.11696  [pdf, other

    q-bio.GN cs.LG cs.SI

    RepBin: Constraint-based Graph Representation Learning for Metagenomic Binning

    Authors: Hansheng Xue, Vijini Mallawaarachchi, Yujia Zhang, Vaibhav Rajan, Yu Lin

    Abstract: Mixed communities of organisms are found in many environments (from the human gut to marine ecosystems) and can have profound impact on human health and the environment. Metagenomics studies the genomic material of such communities through high-throughput sequencing that yields DNA subsequences for subsequent analysis. A fundamental problem in the standard workflow, called binning, is to discover… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: Accepted by AAAI-2022

  10. arXiv:2109.13164  [pdf, other

    stat.ML cs.LG

    Multi-way Clustering and Discordance Analysis through Deep Collective Matrix Tri-Factorization

    Authors: Ragunathan Mariappan, Vaibhav Rajan

    Abstract: Heterogeneous multi-typed, multimodal relational data is increasingly available in many domains and their exploratory analysis poses several challenges. We advance the state-of-the-art in neural unsupervised learning to analyze such data. We design the first neural method for collective matrix tri-factorization of arbitrary collections of matrices to perform spectral clustering of all constituent… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  11. arXiv:2108.00597  [pdf, other

    cs.LG math.OC

    Exact Pareto Optimal Search for Multi-Task Learning and Multi-Criteria Decision-Making

    Authors: Debabrata Mahapatra, Vaibhav Rajan

    Abstract: Given multiple non-convex objective functions and objective-specific weights, Chebyshev scalarization (CS) is a well-known approach to obtain an Exact Pareto Optimal (EPO), i.e., a solution on the Pareto front (PF) that intersects the ray defined by the inverse of the weights. First-order optimizers that use the CS formulation to find EPO solutions encounter practical problems of oscillations and… ▽ More

    Submitted 17 September, 2023; v1 submitted 1 August, 2021; originally announced August 2021.

  12. arXiv:2102.11872  [pdf, other

    cs.LG cs.AI

    Clustering Aware Classification for Risk Prediction and Subtyping in Clinical Data

    Authors: Shivin Srivastava, Siddharth Bhatia, Lingxiao Huang, Lim Jun Heng, Kenji Kawaguchi, Vaibhav Rajan

    Abstract: In data containing heterogeneous subpopulations, classification performance benefits from incorporating the knowledge of cluster structure in the classifier. Previous methods for such combined clustering and classification either 1) are classifier-specific and not generic, or 2) independently perform clustering and classifier training, which may not form clusters that can potentially benefit class… ▽ More

    Submitted 3 January, 2023; v1 submitted 23 February, 2021; originally announced February 2021.

    Comments: 19 Pages, 5 figures

  13. arXiv:2102.06371  [pdf, other

    cs.LG cs.SI

    Multiplex Bipartite Network Embedding using Dual Hypergraph Convolutional Networks

    Authors: Hansheng Xue, Luwei Yang, Vaibhav Rajan, Wen Jiang, Yi Wei, Yu Lin

    Abstract: A bipartite network is a graph structure where nodes are from two distinct domains and only inter-domain interactions exist as edges. A large number of network embedding methods exist to learn vectorial node representations from general graphs with both homogeneous and heterogeneous node and edge types, including some that can specifically model the distinct properties of bipartite networks. Howev… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

    Comments: The Web Conference (formerly WWW) 2021

  14. arXiv:2011.01631  [pdf, other

    cs.LG cs.MM

    Robust Latent Representations via Cross-Modal Translation and Alignment

    Authors: Vandana Rajan, Alessio Brutti, Andrea Cavallaro

    Abstract: Multi-modal learning relates information across observation modalities of the same physical phenomenon to leverage complementary information. Most multi-modal machine learning methods require that all the modalities used for training are also available for testing. This is a limitation when the signals from some modalities are unavailable or are severely degraded by noise. To address this limitati… ▽ More

    Submitted 8 March, 2021; v1 submitted 3 November, 2020; originally announced November 2020.

    Journal ref: ICASSP 2021

  15. arXiv:2009.05805  [pdf, other

    cs.LG stat.ML

    Multi-way Spectral Clustering of Augmented Multi-view Data through Deep Collective Matrix Tri-factorization

    Authors: Ragunathan Mariappan, Siva Rajesh Kasa, Vaibhav Rajan

    Abstract: We present the first deep learning based architecture for collective matrix tri-factorization (DCMTF) of arbitrary collections of matrices, also known as augmented multi-view data. DCMTF can be used for multi-way spectral clustering of heterogeneous collections of relational data matrices to discover latent clusters in each input matrix, across both dimensions, as well as the strengths of associat… ▽ More

    Submitted 24 January, 2022; v1 submitted 12 September, 2020; originally announced September 2020.

  16. arXiv:2007.12786  [pdf, other

    stat.ML cs.LG stat.CO

    Model-based Clustering using Automatic Differentiation: Confronting Misspecification and High-Dimensional Data

    Authors: Siva Rajesh Kasa, Vaibhav Rajan

    Abstract: We study two practically important cases of model based clustering using Gaussian Mixture Models: (1) when there is misspecification and (2) on high dimensional data, in the light of recent advances in Gradient Descent (GD) based optimization using Automatic Differentiation (AD). Our simulation studies show that EM has better clustering performance, measured by Adjusted Rand Index, compared to GD… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

  17. arXiv:1901.02209  [pdf, other

    cs.DS

    Subset Feedback Vertex Set in Chordal and Split Graphs

    Authors: Geevarghese Philip, Varun Rajan, Saket Saurabh, Prafullkumar Tale

    Abstract: In the \textsc{Subset Feedback Vertex Set (Subset-FVS)} problem the input is a graph $G$, a subset \(T\) of vertices of \(G\) called the `terminal' vertices, and an integer $k$. The task is to determine whether there exists a subset of vertices of cardinality at most $k$ which together intersect all cycles which pass through the terminals. \textsc{Subset-FVS} generalizes several well studied prob… ▽ More

    Submitted 8 January, 2019; originally announced January 2019.

  18. arXiv:1811.12640  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Inferring Concept Prerequisite Relations from Online Educational Resources

    Authors: Sudeshna Roy, Meghana Madhyastha, Sheril Lawrence, Vaibhav Rajan

    Abstract: The Internet has rich and rapidly increasing sources of high quality educational content. Inferring prerequisite relations between educational concepts is required for modern large-scale online educational technology applications such as personalized recommendations and automatic curriculum creation. We present PREREQ, a new supervised learning method for inferring concept prerequisite relations.… ▽ More

    Submitted 22 January, 2019; v1 submitted 30 November, 2018; originally announced November 2018.

    Comments: Accepted at the AAAI Conference on Innovative Applications of Artificial Intelligence (IAAI-19)

  19. Deep Collective Matrix Factorization for Augmented Multi-View Learning

    Authors: Ragunathan Mariappan, Vaibhav Rajan

    Abstract: Learning by integrating multiple heterogeneous data sources is a common requirement in many tasks. Collective Matrix Factorization (CMF) is a technique to learn shared latent representations from arbitrary collections of matrices. It can be used to simultaneously complete one or more matrices, for predicting the unknown entries. Classical CMF methods assume linearity in the interaction of latent f… ▽ More

    Submitted 15 April, 2019; v1 submitted 28 November, 2018; originally announced November 2018.

  20. arXiv:1602.07280  [pdf, other

    stat.AP cs.LG

    A Statistical Model for Stroke Outcome Prediction and Treatment Planning

    Authors: Abhishek Sengupta, Vaibhav Rajan, Sakyajit Bhattacharya, G R K Sarma

    Abstract: Stroke is a major cause of mortality and long--term disability in the world. Predictive outcome models in stroke are valuable for personalized treatment, rehabilitation planning and in controlled clinical trials. In this paper we design a new model to predict outcome in the short-term, the putative therapeutic window for several treatments. Our regression-based model has a parametric form that is… ▽ More

    Submitted 22 February, 2016; originally announced February 2016.

  21. arXiv:1501.01894  [pdf

    cs.CL

    Quantifying Scripts: Defining metrics of characters for quantitative and descriptive analysis

    Authors: Vinodh Rajan

    Abstract: Analysis of scripts plays an important role in paleography and in quantitative linguistics. Especially in the field of digital paleography quantitative features are much needed to differentiate glyphs. We describe an elaborate set of metrics that quantify qualitative information contained in characters and hence indirectly also quantify the scribal features. We broadly divide the metrics into seve… ▽ More

    Submitted 8 January, 2015; originally announced January 2015.

    Comments: Manuscript submitted to Literary and Linguistic Computing Journal

  22. arXiv:1203.3519  [pdf

    cs.LG cs.AI stat.ML

    Bayesian Inference in Monte-Carlo Tree Search

    Authors: Gerald Tesauro, V T Rajan, Richard Segal

    Abstract: Monte-Carlo Tree Search (MCTS) methods are drawing great interest after yielding breakthrough results in computer Go. This paper proposes a Bayesian approach to MCTS that is inspired by distributionfree approaches such as UCT [13], yet significantly differs in important respects. The Bayesian framework allows potentially much more accurate (Bayes-optimal) estimation of node values and node uncerta… ▽ More

    Submitted 15 March, 2012; originally announced March 2012.

    Comments: Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

    Report number: UAI-P-2010-PG-580-588