Skip to main content

Showing 1–4 of 4 results for author: Pankajakshan, A

  1. arXiv:2404.13008  [pdf, other

    cs.SD eess.AS

    Enhancing Generalization in Audio Deepfake Detection: A Neural Collapse based Sampling and Training Approach

    Authors: Mohammed Yousif, Jonat John Mathew, Huzaifa Pallan, Agamjeet Singh Padda, Syed Daniyal Shah, Sara Adamski, Madhu Reddiboina, Arjun Pankajakshan

    Abstract: Generalization in audio deepfake detection presents a significant challenge, with models trained on specific datasets often struggling to detect deepfakes generated under varying conditions and unknown algorithms. While collectively training a model using diverse datasets can enhance its generalization ability, it comes with high computational costs. To address this, we propose a neural collapse-b… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  2. arXiv:2403.11778  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Towards the Development of a Real-Time Deepfake Audio Detection System in Communication Platforms

    Authors: Jonat John Mathew, Rakin Ahsan, Sae Furukawa, Jagdish Gautham Krishna Kumar, Huzaifa Pallan, Agamjeet Singh Padda, Sara Adamski, Madhu Reddiboina, Arjun Pankajakshan

    Abstract: Deepfake audio poses a rising threat in communication platforms, necessitating real-time detection for audio stream integrity. Unlike traditional non-real-time approaches, this study assesses the viability of employing static deepfake audio detection models in real-time communication platforms. An executable software is developed for cross-platform compatibility, enabling real-time execution. Two… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  3. arXiv:2005.06650  [pdf, other

    eess.AS cs.LG cs.SD

    Memory Controlled Sequential Self Attention for Sound Recognition

    Authors: Arjun Pankajakshan, Helen L. Bear, Vinod Subramanian, Emmanouil Benetos

    Abstract: In this paper we investigate the importance of the extent of memory in sequential self attention for sound recognition. We propose to use a memory controlled sequential self attention mechanism on top of a convolutional recurrent neural network (CRNN) model for polyphonic sound event detection (SED). Experiments on the URBAN-SED dataset demonstrate the impact of the extent of memory on sound recog… ▽ More

    Submitted 5 August, 2020; v1 submitted 13 May, 2020; originally announced May 2020.

    Comments: Accepted to INTERSPEECH 2020

  4. arXiv:1907.05122  [pdf, other

    eess.AS cs.SD

    Polyphonic Sound Event and Sound Activity Detection: A Multi-task approach

    Authors: Arjun Pankajakshan, Helen L. Bear, Emmanouil Benetos

    Abstract: Polyphonic Sound Event Detection (SED) in real-world recordings is a challenging task because of the dynamic polyphony level, intensity, and duration of sound events. Current polyphonic SED systems fail to model the temporal structure of sound events explicitly and instead attempt to look at which sound events are present at each audio frame. Consequently, the event-wise detection performance is m… ▽ More

    Submitted 1 August, 2019; v1 submitted 11 July, 2019; originally announced July 2019.

    Comments: Accepted to WASPAA 2019