Skip to main content

Showing 1–50 of 98 results for author: Parthasarathy, S

  1. arXiv:2405.15645  [pdf, other

    cs.PF cs.DC

    An Online Probabilistic Distributed Tracing System

    Authors: M. Toslali, S. Qasim, S. Parthasarathy, F. A. Oliveira, H. Huang, G. Stringhini, Z. Liu, A. K. Coskun

    Abstract: Distributed tracing has become a fundamental tool for diagnosing performance issues in the cloud by recording causally ordered, end-to-end workflows of request executions. However, tracing in production workloads can introduce significant overheads due to the extensive instrumentation needed for identifying performance variations. This paper addresses the trade-off between the cost of tracing and… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2405.02228  [pdf, other

    cs.CL cs.AI cs.IR

    REASONS: A benchmark for REtrieval and Automated citationS Of scieNtific Sentences using Public and Proprietary LLMs

    Authors: Deepa Tilwani, Yash Saxena, Ali Mohammadi, Edward Raff, Amit Sheth, Srinivasan Parthasarathy, Manas Gaur

    Abstract: Automatic citation generation for sentences in a document or report is paramount for intelligence analysts, cybersecurity, news agencies, and education personnel. In this research, we investigate whether large language models (LLMs) are capable of generating references based on two forms of sentence queries: (a) Direct Queries, LLMs are asked to provide author names of the given research article,… ▽ More

    Submitted 8 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: Work in progress

  3. arXiv:2405.01573  [pdf, other

    cs.SE cs.AI

    Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository

    Authors: Ajinkya Deshpande, Anmol Agarwal, Shashank Shet, Arun Iyer, Aditya Kanade, Ramakrishna Bairi, Suresh Parthasarathy

    Abstract: LLMs have demonstrated significant potential in code generation tasks, achieving promising results at the function or statement level across various benchmarks. However, the complexities associated with creating code artifacts like classes, particularly within the context of real-world software repositories, remain underexplored. Prior research treats class-level generation as an isolated task, ne… ▽ More

    Submitted 5 June, 2024; v1 submitted 21 April, 2024; originally announced May 2024.

    Comments: Preprint with additional experiments

  4. arXiv:2404.00816  [pdf, other

    cs.LG cs.AI

    HeteroMILE: a Multi-Level Graph Representation Learning Framework for Heterogeneous Graphs

    Authors: Yue Zhang, Yuntian He, Saket Gurukar, Srinivasan Parthasarathy

    Abstract: Heterogeneous graphs are ubiquitous in real-world applications because they can represent various relationships between different types of entities. Therefore, learning embeddings in such graphs is a critical problem in graph machine learning. However, existing solutions for this problem fail to scale to large heterogeneous graphs due to their high computational complexity. To address this issue,… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  5. Grounding from an AI and Cognitive Science Lens

    Authors: Goonmeet Bajaj, Srinivasan Parthasarathy, Valerie L. Shalin, Amit Sheth

    Abstract: Grounding is a challenging problem, requiring a formal definition and different levels of abstraction. This article explores grounding from both cognitive science and machine learning perspectives. It identifies the subtleties of grounding, its significance for collaborative agents, and similarities and differences in grounding approaches in both communities. The article examines the potential of… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Journal ref: IEEE Intelligent Systems, 2024

  6. arXiv:2402.06190  [pdf, other

    cs.CV cs.LG

    Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain

    Authors: Amin Karimi Monsefi, Payam Karisani, Mengxi Zhou, Stacey Choi, Nathan Doble, Heng Ji, Srinivasan Parthasarathy, Rajiv Ramnath

    Abstract: Standard modern machine-learning-based imaging methods have faced challenges in medical applications due to the high cost of dataset construction and, thereby, the limited labeled training data available. Additionally, upon deployment, these methods are usually used to process a large volume of data on a daily basis, imposing a high maintenance cost on medical facilities. In this paper, we introdu… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  7. arXiv:2311.07742  [pdf, other

    cs.IR

    Modeling Sequences as Star Graphs to Address Over-smoothing in Self-attentive Sequential Recommendation

    Authors: Bo Peng, Ziqi Chen, Srinivasan Parthasarathy, Xia Ning

    Abstract: Self-attention (SA) mechanisms have been widely used in developing sequential recommendation (SR) methods, and demonstrated state-of-the-art performance. However, in this paper, we show that self-attentive SR methods substantially suffer from the over-smoothing issue that item embeddings within a sequence become increasingly similar across attention blocks. As widely demonstrated in the literature… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2209.07997

    Journal ref: ACM Transactions on Knowledge Discovery from Data (TKDD) 2024

  8. arXiv:2310.01612  [pdf, other

    cs.IR

    Towards Efficient and Effective Adaptation of Large Language Models for Sequential Recommendation

    Authors: Bo Peng, Ben Burns, Ziqi Chen, Srinivasan Parthasarathy, Xia Ning

    Abstract: In recent years, with large language models (LLMs) achieving state-of-the-art performance in context understanding, increasing efforts have been dedicated to developing LLM-enhanced sequential recommendation (SR) methods. Considering that most existing LLMs are not specifically optimized for recommendation tasks, adapting them for SR becomes a critical step in LLM-enhanced SR methods. Though numer… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  9. arXiv:2309.12938  [pdf, other

    cs.AI cs.SE

    Frustrated with Code Quality Issues? LLMs can Help!

    Authors: Nalin Wadhwa, Jui Pradhan, Atharv Sonwane, Surya Prakash Sahu, Nagarajan Natarajan, Aditya Kanade, Suresh Parthasarathy, Sriram Rajamani

    Abstract: As software projects progress, quality of code assumes paramount importance as it affects reliability, maintainability and security of software. For this reason, static analysis tools are used in developer workflows to flag code quality issues. However, developers need to spend extra efforts to revise their code to improve code quality based on the tool findings. In this work, we investigate the u… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  10. arXiv:2309.12499  [pdf, other

    cs.SE

    CodePlan: Repository-level Coding using LLMs and Planning

    Authors: Ramakrishna Bairi, Atharv Sonwane, Aditya Kanade, Vageesh D C, Arun Iyer, Suresh Parthasarathy, Sriram Rajamani, B. Ashok, Shashank Shet

    Abstract: Software engineering activities such as package migration, fixing errors reports from static analysis or testing, and adding type annotations or other specifications to a codebase, involve pervasively editing the entire repository of code. We formulate these activities as repository-level coding tasks. Recent tools like GitHub Copilot, which are powered by Large Language Models (LLMs), have succ… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  11. arXiv:2309.10195  [pdf, other

    cs.IR

    Multi-modality Meets Re-learning: Mitigating Negative Transfer in Sequential Recommendation

    Authors: Bo Peng, Srinivasan Parthasarathy, Xia Ning

    Abstract: Learning effective recommendation models from sparse user interactions represents a fundamental challenge in developing sequential recommendation methods. Recently, pre-training-based methods have been developed to tackle this challenge. Though promising, in this paper, we show that existing methods suffer from the notorious negative transfer issue, where the model adapted from the pre-trained mod… ▽ More

    Submitted 20 September, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

  12. arXiv:2308.11890  [pdf, other

    cs.LG q-bio.BM

    Shape-conditioned 3D Molecule Generation via Equivariant Diffusion Models

    Authors: Ziqi Chen, Bo Peng, Srinivasan Parthasarathy, Xia Ning

    Abstract: Ligand-based drug design aims to identify novel drug candidates of similar shapes with known active molecules. In this paper, we formulated an in silico shape-conditioned molecule generation problem to generate 3D molecule structures conditioned on the shape of a given molecule. To address this problem, we developed a translation- and rotation-equivariant shape-guided generative model ShapeMol. Sh… ▽ More

    Submitted 16 October, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

  13. arXiv:2307.12465  [pdf, other

    cs.SE

    StaticFixer: From Static Analysis to Static Repair

    Authors: Naman Jain, Shubham Gandhi, Atharv Sonwane, Aditya Kanade, Nagarajan Natarajan, Suresh Parthasarathy, Sriram Rajamani, Rahul Sharma

    Abstract: Static analysis tools are traditionally used to detect and flag programs that violate properties. We show that static analysis tools can also be used to perturb programs that satisfy a property to construct variants that violate the property. Using this insight we can construct paired data sets of unsafe-safe program pairs, and learn strategies to automatically repair property violations. We prese… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  14. arXiv:2306.14357  [pdf, other

    cs.LG cs.SI

    PolicyClusterGCN: Identifying Efficient Clusters for Training Graph Convolutional Networks

    Authors: Saket Gurukar, Shaileshh Bojja Venkatakrishnan, Balaraman Ravindran, Srinivasan Parthasarathy

    Abstract: Graph convolutional networks (GCNs) have achieved huge success in several machine learning (ML) tasks on graph-structured data. Recently, several sampling techniques have been proposed for the efficient training of GCNs and to improve the performance of GCNs on ML tasks. Specifically, the subgraph-based sampling approaches such as ClusterGCN and GraphSAINT have achieved state-of-the-art performanc… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

  15. arXiv:2305.17304  [pdf, other

    cs.CL

    External Language Model Integration for Factorized Neural Transducers

    Authors: Michael Levit, Sarangarajan Parthasarathy, Cem Aksoylar, Mohammad Sadegh Rasooli, Shuangyu Chang

    Abstract: We propose an adaptation method for factorized neural transducers (FNT) with external language models. We demonstrate that both neural and n-gram external LMs add significantly more value when linearly interpolated with predictor output compared to shallow fusion, thus confirming that FNT forces the predictor to act like regular language models. Further, we propose a method to integrate class-base… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  16. FairMILE: Towards an Efficient Framework for Fair Graph Representation Learning

    Authors: Yuntian He, Saket Gurukar, Srinivasan Parthasarathy

    Abstract: Graph representation learning models have demonstrated great capability in many real-world applications. Nevertheless, prior research indicates that these models can learn biased representations leading to discriminatory outcomes. A few works have been proposed to mitigate the bias in graph representations. However, most existing works require exceptional time and computing resources for training… ▽ More

    Submitted 17 October, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

  17. arXiv:2209.07997  [pdf, other

    cs.IR

    Recursive Attentive Methods with Reused Item Representations for Sequential Recommendation

    Authors: Bo Peng, Srinivasan Parthasarathy, Xia Ning

    Abstract: Sequential recommendation aims to recommend the next item of users' interest based on their historical interactions. Recently, the self-attention mechanism has been adapted for sequential recommendation, and demonstrated state-of-the-art performance. However, in this manuscript, we show that the self-attention-based sequential recommendation methods could suffer from the localization-deficit issue… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

  18. arXiv:2209.04041  [pdf, other

    cs.CL

    Multilingual Transformer Language Model for Speech Recognition in Low-resource Languages

    Authors: Li Miao, Jian Wu, Piyush Behre, Shuangyu Chang, Sarangarajan Parthasarathy

    Abstract: It is challenging to train and deploy Transformer LMs for hybrid speech recognition 2nd pass re-ranking in low-resource languages due to (1) data scarcity in low-resource languages, (2) expensive computing costs for training and refreshing 100+ monolingual models, and (3) hosting inefficiency considering sparse traffic. In this study, we present a new way to group multiple low-resource locales tog… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

  19. arXiv:2206.01875  [pdf, other

    cs.IR

    Prospective Preference Enhanced Mixed Attentive Model for Session-based Recommendation

    Authors: Bo Peng, Chang-Yu Tai, Srinivasan Parthasarathy, Xia Ning

    Abstract: Session-based recommendation aims to generate recommendations for the next item of users' interest based on a given session. In this manuscript, we develop prospective preference enhanced mixed attentive model (P2MAM) to generate session-based recommendations using two important factors: temporal patterns and estimates of users' prospective preferences. Unlike existing methods, P2MAM models the te… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

    Comments: Under review by IEEE Transactions on Knowledge and Data Engineering (TKDE)

    Journal ref: Springer Data Mining and Knowledge Discovery (DMKD) 2024

  20. arXiv:2205.10666  [pdf, other

    cs.LG cs.SI

    MultiBiSage: A Web-Scale Recommendation System Using Multiple Bipartite Graphs at Pinterest

    Authors: Saket Gurukar, Nikil Pancha, Andrew Zhai, Eric Kim, Samson Hu, Srinivasan Parthasarathy, Charles Rosenberg, Jure Leskovec

    Abstract: Graph Convolutional Networks (GCN) can efficiently integrate graph structure and node features to learn high-quality node embeddings. These embeddings can then be used for several tasks such as recommendation and search. At Pinterest, we have developed and deployed PinSage, a data-efficient GCN that learns pin embeddings from the Pin-Board graph. The Pin-Board graph contains pin and board entities… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

  21. arXiv:2204.12716  [pdf, other

    cs.CL cs.AI

    UBERT: A Novel Language Model for Synonymy Prediction at Scale in the UMLS Metathesaurus

    Authors: Thilini Wijesiriwardene, Vinh Nguyen, Goonmeet Bajaj, Hong Yung Yip, Vishesh Javangula, Yuqing Mao, Kin Wah Fung, Srinivasan Parthasarathy, Amit P. Sheth, Olivier Bodenreider

    Abstract: The UMLS Metathesaurus integrates more than 200 biomedical source vocabularies. During the Metathesaurus construction process, synonymous terms are clustered into concepts by human editors, assisted by lexical similarity algorithms. This process is error-prone and time-consuming. Recently, a deep learning model (LexLM) has been developed for the UMLS Vocabulary Alignment (UVA) task. This work intr… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

  22. arXiv:2204.05021  [pdf, other

    cs.SE cs.IR cs.PL

    Landmarks and Regions: A Robust Approach to Data Extraction

    Authors: Suresh Parthasarathy, Lincy Pattanaik, Anirudh Khatry, Arun Iyer, Arjun Radhakrishna, Sriram Rajamani, Mohammad Raza

    Abstract: We propose a new approach to extracting data items or field values from semi-structured documents. Examples of such problems include extracting passenger name, departure time and departure airport from a travel itinerary, or extracting price of an item from a purchase receipt. Traditional approaches to data extraction use machine learning or program synthesis to process the whole document to extra… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: To be published at PLDI,2022

  23. FairEGM: Fair Link Prediction and Recommendation via Emulated Graph Modification

    Authors: Sean Current, Yuntian He, Saket Gurukar, Srinivasan Parthasarathy

    Abstract: As machine learning becomes more widely adopted across domains, it is critical that researchers and ML engineers think about the inherent biases in the data that may be perpetuated by the model. Recently, many studies have shown that such biases are also imbibed in Graph Neural Network (GNN) models if the input graph is biased, potentially to the disadvantage of underserved and underrepresented co… ▽ More

    Submitted 20 October, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: 14 pages, 3 figures, 6 tables

    Journal ref: EAAMO (2022)

  24. arXiv:2112.02969  [pdf, other

    cs.SE cs.PL

    Jigsaw: Large Language Models meet Program Synthesis

    Authors: Naman Jain, Skanda Vaidyanath, Arun Iyer, Nagarajan Natarajan, Suresh Parthasarathy, Sriram Rajamani, Rahul Sharma

    Abstract: Large pre-trained language models such as GPT-3, Codex, and Google's language model are now capable of generating code from natural language specifications of programmer intent. We view these developments with a mixture of optimism and caution. On the optimistic side, such large language models have the potential to improve productivity by providing an automated AI pair programmer for every progra… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Accepted to ICSE'22

  25. arXiv:2110.02038  [pdf, other

    cs.LG

    Semi-Supervised Deep Learning for Multiplex Networks

    Authors: Anasua Mitra, Priyesh Vijayan, Ranbir Sanasam, Diganta Goswami, Srinivasan Parthasarathy, Balaraman Ravindran

    Abstract: Multiplex networks are complex graph structures in which a set of entities are connected to each other via multiple types of relations, each relation representing a distinct layer. Such graphs are used to investigate many complex biological, social, and technological systems. In this work, we present a novel semi-supervised approach for structure-aware representation learning on multiplex networks… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  26. arXiv:2110.01500  [pdf, other

    cs.CL eess.AS

    Factorized Neural Transducer for Efficient Language Model Adaptation

    Authors: Xie Chen, Zhong Meng, Sarangarajan Parthasarathy, Jinyu Li

    Abstract: In recent years, end-to-end (E2E) based automatic speech recognition (ASR) systems have achieved great success due to their simplicity and promising performance. Neural Transducer based models are increasingly popular in streaming E2E based ASR systems and have been reported to outperform the traditional hybrid system in some scenarios. However, the joint optimization of acoustic model, lexicon an… ▽ More

    Submitted 18 October, 2021; v1 submitted 27 September, 2021; originally announced October 2021.

  27. arXiv:2109.14638  [pdf, other

    cs.CL cs.CY cs.IR

    Privacy Policy Question Answering Assistant: A Query-Guided Extractive Summarization Approach

    Authors: Moniba Keymanesh, Micha Elsner, Srinivasan Parthasarathy

    Abstract: Existing work on making privacy policies accessible has explored new presentation forms such as color-coding based on the risk factors or summarization to assist users with conscious agreement. To facilitate a more personalized interaction with the policies, in this work, we propose an automated privacy policy question answering assistant that extracts a summary in response to the input user query… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

    Comments: 10 pages , 1 figure

  28. arXiv:2109.13348  [pdf, other

    cs.CL

    Evaluating Biomedical BERT Models for Vocabulary Alignment at Scale in the UMLS Metathesaurus

    Authors: Goonmeet Bajaj, Vinh Nguyen, Thilini Wijesiriwardene, Hong Yung Yip, Vishesh Javangula, Srinivasan Parthasarathy, Amit Sheth, Olivier Bodenreider

    Abstract: The current UMLS (Unified Medical Language System) Metathesaurus construction process for integrating over 200 biomedical source vocabularies is expensive and error-prone as it relies on the lexical algorithms and human editors for deciding if the two biomedical terms are synonymous. Recent advances in Natural Language Processing such as Transformer models like BERT and its biomedical variants wit… ▽ More

    Submitted 15 October, 2021; v1 submitted 14 September, 2021; originally announced September 2021.

  29. arXiv:2107.06243  [pdf, other

    cs.AI cs.CL cs.CY

    Fairness-aware Summarization for Justified Decision-Making

    Authors: Moniba Keymanesh, Tanya Berger-Wolf, Micha Elsner, Srinivasan Parthasarathy

    Abstract: In consequential domains such as recidivism prediction, facility inspection, and benefit assignment, it's important for individuals to know the decision-relevant information for the model's prediction. In addition, predictions should be fair both in terms of the outcome and the justification of the outcome. In other words, decision-relevant features should provide sufficient information for the pr… ▽ More

    Submitted 9 February, 2022; v1 submitted 13 July, 2021; originally announced July 2021.

    Comments: 22 pages, 9 figures

  30. arXiv:2104.00764  [pdf, other

    cs.CL

    SYSML: StYlometry with Structure and Multitask Learning: Implications for Darknet Forum Migrant Analysis

    Authors: Pranav Maneriker, Yuntian He, Srinivasan Parthasarathy

    Abstract: Darknet market forums are frequently used to exchange illegal goods and services between parties who use encryption to conceal their identities. The Tor network is used to host these markets, which guarantees additional anonymization from IP and location tracking, making it challenging to link across malicious users using multiple accounts (sybils). Additionally, users migrate to new forums when o… ▽ More

    Submitted 20 September, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021 camera ready version

  31. arXiv:2103.15764  [pdf, other

    cs.AI

    eDarkTrends: Harnessing Social Media Trends in Substance use disorders for Opioid Listings on Cryptomarket

    Authors: Usha Lokala, Francois Lamy, Triyasha Ghosh Dastidar, Kaushik Roy, Raminta Daniulaityte, Srinivasan Parthasarathy, Amit Sheth

    Abstract: Opioid and substance misuse is rampant in the United States today, with the phenomenon known as the opioid crisis. The relationship between substance use and mental health has been extensively studied, with one possible relationship being substance misuse causes poor mental health. However, the lack of evidence on the relationship has resulted in opioids being largely inaccessible through legal me… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: 6 pages, ICLR AI for Public Health Workshop 2021

    ACM Class: I.2.7

  32. arXiv:2102.06269  [pdf, other

    eess.IV cs.SD eess.AS

    Disentanglement for audio-visual emotion recognition using multitask setup

    Authors: Raghuveer Peri, Srinivas Parthasarathy, Charles Bradshaw, Shiva Sundaram

    Abstract: Deep learning models trained on audio-visual data have been successfully used to achieve state-of-the-art performance for emotion recognition. In particular, models trained with multitask learning have shown additional performance improvements. However, such multitask models entangle information between the tasks, encoding the mutual dependencies present in label distributions in the real world da… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    Comments: Accepted for ICASSP 2021, 5 pages

  33. arXiv:2102.05843  [pdf, other

    cs.CV cs.LG

    Driving Style Representation in Convolutional Recurrent Neural Network Model of Driver Identification

    Authors: Sobhan Moosavi, Pravar D. Mahajan, Srinivasan Parthasarathy, Colleen Saunders-Chukwu, Rajiv Ramnath

    Abstract: Identifying driving styles is the task of analyzing the behavior of drivers in order to capture variations that will serve to discriminate different drivers from each other. This task has become a prerequisite for a variety of applications, including usage-based insurance, driver coaching, driver action prediction, and even in designing autonomous vehicles; because driving style encodes essential… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

    Comments: 12 pages, research on driving style representation

  34. arXiv:2102.01380  [pdf, ps, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition

    Authors: Zhong Meng, Naoyuki Kanda, Yashesh Gaur, Sarangarajan Parthasarathy, Eric Sun, Liang Lu, Xie Chen, Jinyu Li, Yifan Gong

    Abstract: The efficacy of external language model (LM) integration with existing end-to-end (E2E) automatic speech recognition (ASR) systems can be improved significantly using the internal language model estimation (ILME) method. In this method, the internal LM score is subtracted from the score obtained by interpolating the E2E score with the external LM score, during inference. To improve the ILME-based… ▽ More

    Submitted 22 April, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

    Comments: 5 pages, ICASSP 2021

    Journal ref: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, Canada

  35. A Tight Bound for Stochastic Submodular Cover

    Authors: Lisa Hellerstein, Devorah Kletenik, Srinivasan Parthasarathy

    Abstract: We show that the Adaptive Greedy algorithm of Golovin and Krause (2011) achieves an approximation bound of $(\ln (Q/η)+1)$ for Stochastic Submodular Cover: here $Q$ is the "goal value" and $η$ is the smallest non-zero marginal increase in utility deliverable by an item. (For integer-valued utility functions, we show a bound of $H(Q)$, where $H(Q)$ is the $Q^{th}$ Harmonic number.) Although this bo… ▽ More

    Submitted 2 August, 2021; v1 submitted 1 February, 2021; originally announced February 2021.

    Comments: This work extends the result of Srinivasan Parthasarathy in his paper arXiv:1803.07639 from the problem of Stochastic Set Cover to that of Stochastic Submodular Cover

    Journal ref: Journal of Artificial Intelligence Research 71(2021) 347 - 370

  36. arXiv:2012.04231  [pdf, other

    cs.LG cs.NE stat.ML

    A Deep Generative Model for Molecule Optimization via One Fragment Modification

    Authors: Ziqi Chen, Martin Renqiang Min, Srinivasan Parthasarathy, Xia Ning

    Abstract: Molecule optimization is a critical step in drug development to improve desired properties of drug candidates through chemical modification. We developed a novel deep generative model Modof over molecular graphs for molecule optimization. Modof modifies a given molecule through the prediction of a single site of disconnection at the molecule and the removal and/or addition of fragments at that sit… ▽ More

    Submitted 13 January, 2022; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: This paper has been accepted by Nature Machine Intelligence

    Journal ref: Nat Mach Intell. 3 (2021) 1040-1049

  37. arXiv:2012.00063  [pdf, other

    eess.AS cs.SD eess.IV

    Detecting expressions with multimodal transformers

    Authors: Srinivas Parthasarathy, Shiva Sundaram

    Abstract: Developing machine learning algorithms to understand person-to-person engagement can result in natural user experiences for communal devices such as Amazon Alexa. Among other cues such as voice activity and gaze, a person's audio-visual expression that includes tone of the voice and facial expression serves as an implicit signal of engagement between parties in a dialog. This study investigates de… ▽ More

    Submitted 30 November, 2020; originally announced December 2020.

    Comments: IEEE Spoken Language Technology Workshop 2021

  38. Self-Supervised learning with cross-modal transformers for emotion recognition

    Authors: Aparna Khare, Srinivas Parthasarathy, Shiva Sundaram

    Abstract: Emotion recognition is a challenging task due to limited availability of in-the-wild labeled datasets. Self-supervised learning has shown improvements on tasks with limited labeled datasets in domains like speech and natural language. Models such as BERT learn to incorporate context in word embeddings, which translates to improved performance in downstream tasks like question answering. In this wo… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

    Comments: To appear in SLT2020

  39. arXiv:2011.01991  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition

    Authors: Zhong Meng, Sarangarajan Parthasarathy, Eric Sun, Yashesh Gaur, Naoyuki Kanda, Liang Lu, Xie Chen, Rui Zhao, Jinyu Li, Yifan Gong

    Abstract: The external language models (LM) integration remains a challenging task for end-to-end (E2E) automatic speech recognition (ASR) which has no clear division between acoustic and language models. In this work, we propose an internal LM estimation (ILME) method to facilitate a more effective integration of the external LM with all pre-existing E2E models with no additional model training, including… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: 8 pages, 2 figures, SLT 2021

    Journal ref: 2021 IEEE Spoken Language Technology Workshop (SLT)

  40. arXiv:2010.11349  [pdf, ps, other

    cs.CL

    LSTM-LM with Long-Term History for First-Pass Decoding in Conversational Speech Recognition

    Authors: Xie Chen, Sarangarajan Parthasarathy, William Gale, Shuangyu Chang, Michael Zeng

    Abstract: LSTM language models (LSTM-LMs) have been proven to be powerful and yielded significant performance improvements over count based n-gram LMs in modern speech recognition systems. Due to its infinite history states and computational load, most previous studies focus on applying LSTM-LMs in the second-pass for rescoring purpose. Recent work shows that it is feasible and computationally affordable to… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: 5 pages

  41. arXiv:2010.00803  [pdf, ps, other

    cs.IT eess.SP

    Ranging success probability of PPP distributed automotive radar in presence of generalized fading

    Authors: Sudharsan Parthasarathy, Rakshith Jagannath

    Abstract: In automotive radar applications, multiple radars are used in all vehicles for improving the imaging quality. However this causes radar-to-radar interference from neighbouring vehicles, thus reducing the imaging quality. One metric to measure the imaging quality is ranging success probability. The ranging success probability is the probability that a multiple radar system successfully detects an o… ▽ More

    Submitted 2 October, 2020; originally announced October 2020.

  42. arXiv:2010.00734  [pdf, other

    eess.AS cs.SD eess.IV

    Training Strategies to Handle Missing Modalities for Audio-Visual Expression Recognition

    Authors: Srinivas Parthasarathy, Shiva Sundaram

    Abstract: Automatic audio-visual expression recognition can play an important role in communication services such as tele-health, VOIP calls and human-machine interaction. Accuracy of audio-visual expression recognition could benefit from the interplay between the two modalities. However, most audio-visual expression recognition systems, trained in ideal conditions, fail to generalize in real world scenario… ▽ More

    Submitted 30 November, 2020; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: ICMI 2020 workshop on "MODELING SOCIO-EMOTIONAL AND COGNITIVE PROCESSES FROM MULTIMODAL DATA IN THE WILD"

  43. Multi-modal embeddings using multi-task learning for emotion recognition

    Authors: Aparna Khare, Srinivas Parthasarathy, Shiva Sundaram

    Abstract: General embeddings like word2vec, GloVe and ELMo have shown a lot of success in natural language tasks. The embeddings are typically extracted from models that are built on general tasks such as skip-gram models and natural language generation. In this paper, we extend the work from natural language understanding to multi-modal architectures that use audio, visual and textual information for machi… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

    Comments: To appear in Interspeech,2020

  44. arXiv:2007.15188  [pdf, other

    eess.AS cs.CL cs.SD

    Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability

    Authors: Jinyu Li, Rui Zhao, Zhong Meng, Yanqing Liu, Wenning Wei, Sarangarajan Parthasarathy, Vadim Mazalov, Zhenghao Wang, Lei He, Sheng Zhao, Yifan Gong

    Abstract: Because of its streaming nature, recurrent neural network transducer (RNN-T) is a very promising end-to-end (E2E) model that may replace the popular hybrid model for automatic speech recognition. In this paper, we describe our recent development of RNN-T models with reduced GPU memory consumption during training, better initialization strategy, and advanced encoder modeling with future lookahead.… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

    Comments: Accepted by Interspeech 2020

  45. arXiv:2007.02384  [pdf, other

    cs.DB

    DrugDBEmbed : Semantic Queries on Relational Database using Supervised Column Encodings

    Authors: Bortik Bandyopadhyay, Pranav Maneriker, Vedang Patel, Saumya Yashmohini Sahai, Ping Zhang, Srinivasan Parthasarathy

    Abstract: Traditional relational databases contain a lot of latent semantic information that have largely remained untapped due to the difficulty involved in automatically extracting such information. Recent works have proposed unsupervised machine learning approaches to extract such hidden information by textifying the database columns and then projecting the text tokens onto a fixed dimensional semantic v… ▽ More

    Submitted 5 July, 2020; originally announced July 2020.

  46. HPRA: Hyperedge Prediction using Resource Allocation

    Authors: Tarun Kumar, K Darwin, Srinivasan Parthasarathy, Balaraman Ravindran

    Abstract: Many real-world systems involve higher-order interactions and thus demand complex models such as hypergraphs. For instance, a research article could have multiple collaborating authors, and therefore the co-authorship network is best represented as a hypergraph. In this work, we focus on the problem of hyperedge prediction. This problem has immense applications in multiple domains, such as predict… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

    Comments: Accepted at WebSci'20

  47. arXiv:2004.14840  [pdf, other

    eess.AS cs.CV cs.LG cs.SD stat.ML

    Multiresolution and Multimodal Speech Recognition with Transformers

    Authors: Georgios Paraskevopoulos, Srinivas Parthasarathy, Aparna Khare, Shiva Sundaram

    Abstract: This paper presents an audio visual automatic speech recognition (AV-ASR) system using a Transformer-based architecture. We particularly focus on the scene context provided by the visual information, to ground the ASR. We extract representations for audio features in the encoder layers of the transformer and fuse video features using an additional crossmodal multihead attention layer. Additionally… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

    Comments: Accepted for ACL 2020

  48. arXiv:2004.03755  [pdf, other

    cs.CL cs.AI cs.CV

    Understanding Knowledge Gaps in Visual Question Answering: Implications for Gap Identification and Testing

    Authors: Goonmeet Bajaj, Bortik Bandyopadhyay, Daniel Schmidt, Pranav Maneriker, Christopher Myers, Srinivasan Parthasarathy

    Abstract: Visual Question Answering (VQA) systems are tasked with answering natural language questions corresponding to a presented image. Traditional VQA datasets typically contain questions related to the spatial information of objects, object attributes, or general scene questions. Recently, researchers have recognized the need to improve the balance of such datasets to reduce the system's dependency on… ▽ More

    Submitted 3 June, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

  49. arXiv:2004.01646  [pdf, other

    cs.LG cs.IR stat.ML

    M2: Mixed Models with Preferences, Popularities and Transitions for Next-Basket Recommendation

    Authors: Bo Peng, Zhiyun Ren, Srinivasan Parthasarathy, Xia Ning

    Abstract: Next-basket recommendation considers the problem of recommending a set of items into the next basket that users will purchase as a whole. In this paper, we develop a novel mixed model with preferences, popularities and transitions (M2) for the next-basket recommendation. This method models three important factors in next-basket generation process: 1) users' general preferences, 2) items' global po… ▽ More

    Submitted 17 January, 2022; v1 submitted 3 April, 2020; originally announced April 2020.

    Comments: Accepted by IEEE Transactions on Knowledge and Data Engineering (TKDE)

  50. arXiv:2002.11890  [pdf, other

    cs.IR cs.LG

    HAM: Hybrid Associations Models for Sequential Recommendation

    Authors: Bo Peng, Zhiyun Ren, Srinivasan Parthasarathy, Xia Ning

    Abstract: Sequential recommendation aims to identify and recommend the next few items for a user that the user is most likely to purchase/review, given the user's purchase/rating trajectories. It becomes an effective tool to help users select favorite items from a variety of options. In this manuscript, we developed hybrid associations models (HAM) to generate sequential recommendations using three factors:… ▽ More

    Submitted 4 January, 2021; v1 submitted 26 February, 2020; originally announced February 2020.

    Comments: This paper has been accepted by IEEE Transactions on Knowledge and Data Engineering (TKDE)