Skip to main content

Showing 1–10 of 10 results for author: Balaji, P

  1. arXiv:2407.04272  [pdf, other

    cs.LG cs.DC

    Accelerating Communication in Deep Learning Recommendation Model Training with Dual-Level Adaptive Lossy Compression

    Authors: Hao Feng, Boyuan Zhang, Fanjiang Ye, Min Si, Ching-Hsiang Chu, Jiannan Tian, Chunxing Yin, Summer Deng, Yuchen Hao, Pavan Balaji, Tong Geng, Dingwen Tao

    Abstract: DLRM is a state-of-the-art recommendation system model that has gained widespread adoption across various industry applications. The large size of DLRM models, however, necessitates the use of multiple devices/GPUs for efficient training. A significant bottleneck in this process is the time-consuming all-to-all communication required to collect embedding data from all devices. To mitigate this, we… ▽ More

    Submitted 11 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: accepted by SC '24

  2. UpDLRM: Accelerating Personalized Recommendation using Real-World PIM Architecture

    Authors: Sitian Chen, Haobin Tan, Amelie Chi Zhou, Yusen Li, Pavan Balaji

    Abstract: Deep Learning Recommendation Models (DLRMs) have gained popularity in recommendation systems due to their effectiveness in handling large-scale recommendation tasks. The embedding layers of DLRMs have become the performance bottleneck due to their intensive needs on memory capacity and memory bandwidth. In this paper, we propose UpDLRM, which utilizes real-world processingin-memory (PIM) hardware,… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  3. arXiv:2406.02411  [pdf, other

    cs.CV

    Decoupling of neural network calibration measures

    Authors: Dominik Werner Wolf, Prasannavenkatesh Balaji, Alexander Braun, Markus Ulrich

    Abstract: A lot of effort is currently invested in safeguarding autonomous driving systems, which heavily rely on deep neural networks for computer vision. We investigate the coupling of different neural network calibration measures with a special focus on the Area Under the Sparsification Error curve (AUSE) metric. We elaborate on the well-known inconsistency in determining optimal calibration using the Ex… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Submitted to the German Conference on Pattern Recognition (GCPR) 2024

  4. arXiv:2311.16052  [pdf, other

    cs.CV

    Exploring Attribute Variations in Style-based GANs using Diffusion Models

    Authors: Rishubh Parihar, Prasanna Balaji, Raghav Magazine, Sarthak Vora, Tejan Karmali, Varun Jampani, R. Venkatesh Babu

    Abstract: Existing attribute editing methods treat semantic attributes as binary, resulting in a single edit per attribute. However, attributes such as eyeglasses, smiles, or hairstyles exhibit a vast range of diversity. In this work, we formulate the task of \textit{diverse attribute editing} by modeling the multidimensional nature of attribute edits. This enables users to generate multiple plausible edits… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Neurips Workshop on Diffusion Models 2023

  5. arXiv:2310.20704  [pdf, other

    cs.CV cs.AI

    Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders

    Authors: Srijan Das, Tanmay Jain, Dominick Reilly, Pranav Balaji, Soumyajit Karmakar, Shyam Marjit, Xiang Li, Abhijit Das, Michael S. Ryoo

    Abstract: Vision Transformers (ViTs) have become ubiquitous in computer vision. Despite their success, ViTs lack inductive biases, which can make it difficult to train them with limited data. To address this challenge, prior studies suggest training ViTs with self-supervised learning (SSL) and fine-tuning sequentially. However, we observe that jointly optimizing ViTs for the primary task and a Self-Supervis… ▽ More

    Submitted 27 December, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: Accepted to WACV 2024

  6. arXiv:2308.13503  [pdf, other

    cs.CV cs.AI

    Attending Generalizability in Course of Deep Fake Detection by Exploring Multi-task Learning

    Authors: Pranav Balaji, Abhijit Das, Srijan Das, Antitza Dantcheva

    Abstract: This work explores various ways of exploring multi-task learning (MTL) techniques aimed at classifying videos as original or manipulated in cross-manipulation scenario to attend generalizability in deep fake scenario. The dataset used in our evaluation is FaceForensics++, which features 1000 original videos manipulated by four different techniques, with a total of 5000 videos. We conduct extensive… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  7. How I Learned to Stop Worrying About User-Visible Endpoints and Love MPI

    Authors: Rohit Zambre, Aparna Chandramowlishwaran, Pavan Balaji

    Abstract: MPI+threads is gaining prominence as an alternative to the traditional MPI everywhere model in order to better handle the disproportionate increase in the number of cores compared with other on-node resources. However, the communication performance of MPI+threads can be 100x slower than that of MPI everywhere. Both MPI users and developers are to blame for this slowdown. Typically, MPI users do no… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    Comments: In Proceedings of the 34th ACM International Conference on Supercomputing (ICS), Barcelona, Spain, June 2020

    ACM Class: C.2.4

  8. arXiv:2002.07677  [pdf

    cs.SD cs.MM eess.AS

    Performance Analysis of Adaptive Noise Cancellation for Speech Signal

    Authors: Pratibha Balaji, Shruthi Narayan, Durga Sraddha, Bharath K P, Karthik R, Rajesh Kumar Muthu

    Abstract: This paper gives a broader insight on the application of adaptive filter in noise cancellation during various processes where signal is transmitted. Adaptive filtering techniques like RLS, LMS and normalized LMS are used to filter the input signal using the concept of negative feedback to predict its nature and remove it effectively from the input. In this paper a comparative study between the eff… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.

  9. arXiv:2002.07590  [pdf

    eess.AS cs.IR cs.SD

    Speech Emotion Recognition using Support Vector Machine

    Authors: Manas Jain, Shruthi Narayan, Pratibha Balaji, Bharath K P, Abhijit Bhowmick, Karthik R, Rajesh Kumar Muthu

    Abstract: In this project, we aim to classify the speech taken as one of the four emotions namely, sadness, anger, fear and happiness. The samples that have been taken to complete this project are taken from Linguistic Data Consortium (LDC) and UGA database. The important characteristics determined from the samples are energy, pitch, MFCC coefficients, LPCC coefficients and speaker rate. The classifier used… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.

  10. Scalable Communication Endpoints for MPI+Threads Applications

    Authors: Rohit Zambre, Aparna Chandramowlishwaran, Pavan Balaji

    Abstract: Hybrid MPI+threads programming is gaining prominence as an alternative to the traditional "MPI everywhere'" model to better handle the disproportionate increase in the number of cores compared with other on-node resources. Current implementations of these two models represent the two extreme cases of communication resource sharing in modern MPI implementations. In the MPI-everywhere model, each MP… ▽ More

    Submitted 6 February, 2020; originally announced February 2020.

    Comments: In Proceedings of the 24th IEEE International Conference on Parallel and Distributed Systems (ICPADS), Sentosa, Singapore, December 2018. Best Poster Award

    Journal ref: In 2018 IEEE 24th International Conference on Parallel and Distributed Systems (ICPADS), pp. 803-812. IEEE, 2018