Skip to main content

Showing 1–11 of 11 results for author: Desai, J

  1. arXiv:2406.02592  [pdf, other

    cs.LG cs.AI cs.CL

    LOLAMEME: Logic, Language, Memory, Mechanistic Framework

    Authors: Jay Desai, Xiaobo Guo, Srinivasan H. Sengamedu

    Abstract: The performance of Large Language Models has achieved superhuman breadth with unprecedented depth. At the same time, the language models are mostly black box models and the underlying mechanisms for performance have been evaluated using synthetic or mechanistic schemes. We extend current mechanistic schemes to incorporate Logic, memory, and nuances of Language such as latent structure. The propose… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: https://openreview.net/pdf?id=73dhbcXxtV

  2. arXiv:2405.18642  [pdf, other

    cs.AI cs.CL

    JADS: A Framework for Self-supervised Joint Aspect Discovery and Summarization

    Authors: Xiaobo Guo, Jay Desai, Srinivasan H. Sengamedu

    Abstract: To generate summaries that include multiple aspects or topics for text documents, most approaches use clustering or topic modeling to group relevant sentences and then generate a summary for each group. These approaches struggle to optimize the summarization and clustering algorithms jointly. On the other hand, aspect-based summarization requires known aspects. Our solution integrates topic discov… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: preprint

  3. arXiv:2405.10925  [pdf

    stat.ME cs.AI cs.LG

    High-dimensional multiple imputation (HDMI) for partially observed confounders including natural language processing-derived auxiliary covariates

    Authors: Janick Weberpals, Pamela A. Shaw, Kueiyu Joshua Lin, Richard Wyss, Joseph M Plasek, Li Zhou, Kerry Ngan, Thomas DeRamus, Sudha R. Raman, Bradley G. Hammill, Hana Lee, Sengwee Toh, John G. Connolly, Kimberly J. Dandreo, Fang Tian, Wei Liu, Jie Li, José J. Hernández-Muñoz, Sebastian Schneeweiss, Rishi J. Desai

    Abstract: Multiple imputation (MI) models can be improved by including auxiliary covariates (AC), but their performance in high-dimensional data is not well understood. We aimed to develop and compare high-dimensional MI (HDMI) approaches using structured and natural language processing (NLP)-derived AC in studies with partially observed confounders. We conducted a plasmode simulation study using data from… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  4. arXiv:2308.16325  [pdf

    cs.CV

    Two-Stage Violence Detection Using ViTPose and Classification Models at Smart Airports

    Authors: İrem Üstek, Jay Desai, Iván López Torrecillas, Sofiane Abadou, Jinjie Wang, Quentin Fever, Sandhya Rani Kasthuri, Yang Xing, Weisi Guo, Antonios Tsourdos

    Abstract: This study introduces an innovative violence detection framework tailored to the unique requirements of smart airports, where prompt responses to violent situations are crucial. The proposed framework harnesses the power of ViTPose for human pose estimation. It employs a CNN - BiLSTM network to analyse spatial and temporal information within keypoints sequences, enabling the accurate classificatio… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  5. arXiv:2307.04804  [pdf, other

    cs.CL cs.AI

    S2vNTM: Semi-supervised vMF Neural Topic Modeling

    Authors: Weijie Xu, Jay Desai, Srinivasan Sengamedu, Xiaoyu Jiang, Francis Iannacci

    Abstract: Language model based methods are powerful techniques for text classification. However, the models have several shortcomings. (1) It is difficult to integrate human knowledge such as keywords. (2) It needs a lot of resources to train the models. (3) It relied on large text data to pretrain. In this paper, we propose Semi-Supervised vMF Neural Topic Modeling (S2vNTM) to overcome these difficulties.… ▽ More

    Submitted 8 February, 2024; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: 17 pages, 9 figures, ICLR Workshop 2023. arXiv admin note: text overlap with arXiv:2307.01226

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: ICLR Workshop 2023

  6. arXiv:2307.01878  [pdf, other

    cs.CL cs.AI

    KDSTM: Neural Semi-supervised Topic Modeling with Knowledge Distillation

    Authors: Weijie Xu, Xiaoyu Jiang, Jay Desai, Bin Han, Fuqin Yan, Francis Iannacci

    Abstract: In text classification tasks, fine tuning pretrained language models like BERT and GPT-3 yields competitive accuracy; however, both methods require pretraining on large text datasets. In contrast, general topic modeling methods possess the advantage of analyzing documents to extract meaningful patterns of words without the need of pretraining. To leverage topic modeling's unsupervised insights ext… ▽ More

    Submitted 11 February, 2024; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 12 pages, 4 figures, ICLR 2022 Workshop

    MSC Class: 68T50 ACM Class: I.2.6

    Journal ref: ICLR 2022 Workshop PML4DC

  7. arXiv:2212.12652  [pdf, other

    cs.CL

    STRUDEL: Structured Dialogue Summarization for Dialogue Comprehension

    Authors: Borui Wang, Chengcheng Feng, Arjun Nair, Madelyn Mao, Jai Desai, Asli Celikyilmaz, Haoran Li, Yashar Mehdad, Dragomir Radev

    Abstract: Abstractive dialogue summarization has long been viewed as an important standalone task in natural language processing, but no previous work has explored the possibility of whether abstractive dialogue summarization can also be used as a means to boost an NLP system's performance on other important dialogue comprehension tasks. In this paper, we propose a novel type of dialogue summarization task… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

    Comments: EMNLP 2022

  8. arXiv:2204.01849  [pdf

    cs.CL cs.IR cs.LG

    Automatic Text Summarization Methods: A Comprehensive Review

    Authors: Divakar Yadav, Jalpa Desai, Arun Kumar Yadav

    Abstract: One of the most pressing issues that have arisen due to the rapid growth of the Internet is known as information overloading. Simplifying the relevant information in the form of a summary will assist many people because the material on any topic is plentiful on the Internet. Manually summarising massive amounts of text is quite challenging for humans. So, it has increased the need for more complex… ▽ More

    Submitted 3 March, 2022; originally announced April 2022.

    Comments: 20 pages, 7 figures and 4 tables

  9. arXiv:2203.03428  [pdf, other

    cs.SD cs.LG eess.AS

    Attention-based Region of Interest (ROI) Detection for Speech Emotion Recognition

    Authors: Jay Desai, Houwei Cao, Ravi Shah

    Abstract: Automatic emotion recognition for real-life appli-cations is a challenging task. Human emotion expressions aresubtle, and can be conveyed by a combination of several emo-tions. In most existing emotion recognition studies, each audioutterance/video clip is labelled/classified in its entirety. However,utterance/clip-level labelling and classification can be too coarseto capture the subtle intra-utt… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: Paper written in 2019

  10. CONFIT: Toward Faithful Dialogue Summarization with Linguistically-Informed Contrastive Fine-tuning

    Authors: Xiangru Tang, Arjun Nair, Borui Wang, Bingyao Wang, Jai Desai, Aaron Wade, Haoran Li, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev

    Abstract: Factual inconsistencies in generated summaries severely limit the practical applications of abstractive dialogue summarization. Although significant progress has been achieved by using pre-trained models, substantial amounts of hallucinated content are found during the human evaluation. Pre-trained models are most commonly fine-tuned with cross-entropy loss for text summarization, which may not be… ▽ More

    Submitted 9 July, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Journal ref: NAACL 2022

  11. A Fast Keypoint Based Hybrid Method for Copy Move Forgery Detection

    Authors: Sunil Kumar, J. V. Desai, Shaktidev Mukherjee

    Abstract: Copy move forgery detection in digital images has become a very popular research topic in the area of image forensics. Due to the availability of sophisticated image editing tools and ever increasing hardware capabilities, it has become an easy task to manipulate the digital images. Passive forgery detection techniques are more relevant as they can be applied without the prior information about th… ▽ More

    Submitted 10 December, 2016; originally announced December 2016.