Skip to main content

Showing 1–50 of 80 results for author: Saha, P

  1. arXiv:2407.06226  [pdf

    quant-ph cs.LG

    Quantum Machine Learning with Application to Progressive Supranuclear Palsy Network Classification

    Authors: Papri Saha

    Abstract: Machine learning and quantum computing are being progressively explored to shed light on possible computational approaches to deal with hitherto unsolvable problems. Classical methods for machine learning are ubiquitous in pattern recognition, with support vector machines (SVMs) being a prominent technique for network classification. However, there are limitations to the successful resolution of s… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  2. arXiv:2406.19543  [pdf, other

    cs.CL cs.SI

    Demarked: A Strategy for Enhanced Abusive Speech Moderation through Counterspeech, Detoxification, and Message Management

    Authors: Seid Muhie Yimam, Daryna Dementieva, Tim Fischer, Daniil Moskovskiy, Naquee Rizwan, Punyajoy Saha, Sarthak Roy, Martin Semmann, Alexander Panchenko, Chris Biemann, Animesh Mukherjee

    Abstract: Despite regulations imposed by nations and social media platforms, such as recent EU regulations targeting digital violence, abusive content persists as a significant challenge. Existing approaches primarily rely on binary solutions, such as outright blocking or banning, yet fail to address the complex nature of abusive speech. In this work, we propose a more comprehensive approach called Demarcat… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  3. arXiv:2406.12911  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    The Promise of Analog Deep Learning: Recent Advances, Challenges and Opportunities

    Authors: Aditya Datar, Pramit Saha

    Abstract: Much of the present-day Artificial Intelligence (AI) utilizes artificial neural networks, which are sophisticated computational models designed to recognize patterns and solve complex problems by learning from data. However, a major bottleneck occurs during a device's calculation of weighted sums for forward propagation and optimization procedure for backpropagation, especially for deep neural net… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2406.11636  [pdf, other

    eess.IV cs.CV cs.LG

    Feasibility of Federated Learning from Client Databases with Different Brain Diseases and MRI Modalities

    Authors: Felix Wagner, Wentian Xu, Pramit Saha, Ziyun Liang, Daniel Whitehouse, David Menon, Natalie Voets, J. Alison Noble, Konstantinos Kamnitsas

    Abstract: Segmentation models for brain lesions in MRI are commonly developed for a specific disease and trained on data with a predefined set of MRI modalities. Each such model cannot segment the disease using data with a different set of MRI modalities, nor can it segment any other type of disease. Moreover, this training paradigm does not allow a model to benefit from learning from heterogeneous database… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    ACM Class: I.4.9; I.4.6; I.2.11; I.4.0

  5. arXiv:2406.06703  [pdf, other

    cs.CV cs.LG

    Video-based Exercise Classification and Activated Muscle Group Prediction with Hybrid X3D-SlowFast Network

    Authors: Manvik Pasula, Pramit Saha

    Abstract: This paper introduces a simple yet effective strategy for exercise classification and muscle group activation prediction (MGAP). These tasks have significant implications for personal fitness, facilitating more affordable, accessible, safer, and simpler exercise routines. This is particularly relevant for novices and individuals with disabilities. Previous research in the field is mostly dominated… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 16 pages, 7 figures, submitted to IEEE Open Journal of the Computer Society

    ACM Class: I.2.10; I.4.8

  6. arXiv:2405.04023  [pdf, other

    eess.IV cs.CV

    Lumbar Spine Tumor Segmentation and Localization in T2 MRI Images Using AI

    Authors: Rikathi Pal, Sudeshna Mondal, Aditi Gupta, Priya Saha, Somoballi Ghoshal, Amlan Chakrabarti, Susmita Sur-Kolay

    Abstract: In medical imaging, segmentation and localization of spinal tumors in three-dimensional (3D) space pose significant computational challenges, primarily stemming from limited data availability. In response, this study introduces a novel data augmentation technique, aimed at automating spine tumor segmentation and localization through AI approaches. Leveraging a fusion of fuzzy c-means clustering an… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 9 pages, 12 figures

  7. arXiv:2404.18291  [pdf, other

    cs.CV cs.AI

    Panoptic Segmentation and Labelling of Lumbar Spine Vertebrae using Modified Attention Unet

    Authors: Rikathi Pal, Priya Saha, Somoballi Ghoshal, Amlan Chakrabarti, Susmita Sur-Kolay

    Abstract: Segmentation and labeling of vertebrae in MRI images of the spine are critical for the diagnosis of illnesses and abnormalities. These steps are indispensable as MRI technology provides detailed information about the tissue structure of the spine. Both supervised and unsupervised segmentation methods exist, yet acquiring sufficient data remains challenging for achieving high accuracy. In this stud… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 9 pages, 10 figures

  8. arXiv:2404.04283  [pdf, other

    cs.CV cs.LG eess.IV

    Translation-based Video-to-Video Synthesis

    Authors: Pratim Saha, Chengcui Zhang

    Abstract: Translation-based Video Synthesis (TVS) has emerged as a vital research area in computer vision, aiming to facilitate the transformation of videos between distinct domains while preserving both temporal continuity and underlying content features. This technique has found wide-ranging applications, encompassing video super-resolution, colorization, segmentation, and more, by extending the capabilit… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 25 pages, 9 figures

  9. arXiv:2403.14938  [pdf, ps, other

    cs.CL

    On Zero-Shot Counterspeech Generation by LLMs

    Authors: Punyajoy Saha, Aalok Agrawal, Abhik Jana, Chris Biemann, Animesh Mukherjee

    Abstract: With the emergence of numerous Large Language Models (LLM), the usage of such models in various Natural Language Processing (NLP) applications is increasing extensively. Counterspeech generation is one such key task where efforts are made to develop generative models by fine-tuning LLMs with hatespeech - counterspeech pairs, but none of these attempts explores the intrinsic properties of large lan… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 12 pages, 7 tables, accepted at LREC-COLING 2024

  10. arXiv:2403.12161  [pdf

    cs.CE cs.CY q-fin.GN

    Effect of Leaders Voice on Financial Market: An Empirical Deep Learning Expedition on NASDAQ, NSE, and Beyond

    Authors: Arijit Das, Tanmoy Nandi, Prasanta Saha, Suman Das, Saronyo Mukherjee, Sudip Kumar Naskar, Diganta Saha

    Abstract: Financial market like the price of stock, share, gold, oil, mutual funds are affected by the news and posts on social media. In this work deep learning based models are proposed to predict the trend of financial market based on NLP analysis of the twitter handles of leaders of different fields. There are many models available to predict financial market based on only the historical data of the fin… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 20 pages original research

  11. arXiv:2402.14702  [pdf, other

    cs.CL

    InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks

    Authors: Somnath Banerjee, Maulindu Sarkar, Punyajoy Saha, Binny Mathew, Animesh Mukherjee

    Abstract: Recently, influence functions present an apparatus for achieving explainability for deep neural models by quantifying the perturbation of individual train instances that might impact a test prediction. Our objectives in this paper are twofold. First we incorporate influence functions as a feedback into the model to improve its performance. Second, in a dataset extension exercise, using influence f… ▽ More

    Submitted 9 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted at LREC-COLING 2024 (Long Paper)

  12. arXiv:2402.12198  [pdf, other

    cs.CL cs.CV cs.LG

    Zero shot VLMs for hate meme detection: Are we there yet?

    Authors: Naquee Rizwan, Paramananda Bhaskar, Mithun Das, Swadhin Satyaprakash Majhi, Punyajoy Saha, Animesh Mukherjee

    Abstract: Multimedia content on social media is rapidly evolving, with memes gaining prominence as a distinctive form. Unfortunately, some malicious users exploit memes to target individuals or vulnerable communities, making it imperative to identify and address such instances of hateful memes. Extensive research has been conducted to address this issue by developing hate meme detection models. However, a n… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  13. arXiv:2402.07262  [pdf, other

    cs.CL cs.HC

    Low-Resource Counterspeech Generation for Indic Languages: The Case of Bengali and Hindi

    Authors: Mithun Das, Saurabh Kumar Pandey, Shivansh Sethi, Punyajoy Saha, Animesh Mukherjee

    Abstract: With the rise of online abuse, the NLP community has begun investigating the use of neural architectures to generate counterspeech that can "counter" the vicious tone of such abusive speech and dilute/ameliorate their rippling effect over the social network. However, most of the efforts so far have been primarily focused on English. To bridge the gap for low-resource languages such as Bengali and… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Comments: Accepted to the Findings of the ACL: EACL 2024

  14. arXiv:2402.05294  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Examining Modality Incongruity in Multimodal Federated Learning for Medical Vision and Language-based Disease Detection

    Authors: Pramit Saha, Divyanshu Mishra, Felix Wagner, Konstantinos Kamnitsas, J. Alison Noble

    Abstract: Multimodal Federated Learning (MMFL) utilizes multiple modalities in each client to build a more powerful Federated Learning (FL) model than its unimodal counterpart. However, the impact of missing modality in different clients, also called modality incongruity, has been greatly overlooked. This paper, for the first time, analyses the impact of modality incongruity and reveals its connection with… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 42 pages

  15. arXiv:2312.07571  [pdf, other

    cs.CV cs.AI cs.LG

    Investigating YOLO Models Towards Outdoor Obstacle Detection For Visually Impaired People

    Authors: Chenhao He, Pramit Saha

    Abstract: The utilization of deep learning-based object detection is an effective approach to assist visually impaired individuals in avoiding obstacles. In this paper, we implemented seven different YOLO object detection models \textit{viz}., YOLO-NAS (small, medium, large), YOLOv8, YOLOv7, YOLOv6, and YOLOv5 and performed comprehensive evaluation with carefully tuned hyperparameters, to analyze how these… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  16. arXiv:2312.05717  [pdf, other

    cs.LG cs.AI

    Forecasting Lithium-Ion Battery Longevity with Limited Data Availability: Benchmarking Different Machine Learning Algorithms

    Authors: Hudson Hilal, Pramit Saha

    Abstract: As the use of Lithium-ion batteries continues to grow, it becomes increasingly important to be able to predict their remaining useful life. This work aims to compare the relative performance of different machine learning algorithms, both traditional machine learning and deep learning, in order to determine the best-performing algorithms for battery cycle life prediction based on minimal data. We i… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  17. arXiv:2311.00514  [pdf, other

    cs.HC cs.IT

    How Hard Is Squash? -- Towards Information Theoretic Analysis of Motor Behavior in Squash

    Authors: Kavya Anand, Pramit Saha

    Abstract: Fitts' law has been widely employed as a research method for analyzing tasks within the domain of Human-Computer Interaction (HCI). However, its application to non-computer tasks has remained limited. This study aims to extend the application of Fitts' law to the realm of sports, specifically focusing on squash. Squash is a high-intensity sport that requires quick movements and precise shots. Our… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  18. arXiv:2311.00469  [pdf, other

    cs.CV cs.AI cs.LG

    Dual Conditioned Diffusion Models for Out-Of-Distribution Detection: Application to Fetal Ultrasound Videos

    Authors: Divyanshu Mishra, He Zhao, Pramit Saha, Aris T. Papageorghiou, J. Alison Noble

    Abstract: Out-of-distribution (OOD) detection is essential to improve the reliability of machine learning models by detecting samples that do not belong to the training distribution. Detecting OOD samples effectively in certain tasks can pose a challenge because of the substantial heterogeneity within the in-distribution (ID), and the high structural similarity between ID and OOD classes. For instance, when… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Published in MICCAI 2023

  19. arXiv:2310.18815  [pdf, other

    cs.LG cs.AI cs.CV

    Rethinking Semi-Supervised Federated Learning: How to co-train fully-labeled and fully-unlabeled client imaging data

    Authors: Pramit Saha, Divyanshu Mishra, J. Alison Noble

    Abstract: The most challenging, yet practical, setting of semi-supervised federated learning (SSFL) is where a few clients have fully labeled data whereas the other clients have fully unlabeled data. This is particularly common in healthcare settings where collaborating partners (typically hospitals) may have images but not annotations. The bottleneck in this setting is the joint training of labeled and unl… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Published in MICCAI 2023 with early acceptance and selected as 1 of the top 20 poster highlights under the category: Which work has the potential to impact other applications of AI and CV

  20. arXiv:2310.12860  [pdf, other

    cs.CL cs.CY

    Probing LLMs for hate speech detection: strengths and vulnerabilities

    Authors: Sarthak Roy, Ashish Harshavardhan, Animesh Mukherjee, Punyajoy Saha

    Abstract: Recently efforts have been made by social media platforms as well as researchers to detect hateful or toxic language using large language models. However, none of these works aim to use explanation, additional context and victim community information in the detection process. We utilise different prompt variation, input information and evaluate large language models in zero shot setting (without a… ▽ More

    Submitted 28 October, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: 13 pages, 9 figures, 7 tables, accepted to findings of EMNLP 2023

  21. Using ChatGPT in HCI Research -- A Trioethnography

    Authors: Smit Desai, Tanusree Sharma, Pratyasha Saha

    Abstract: This paper explores the lived experience of using ChatGPT in HCI research through a month-long trioethnography. Our approach combines the expertise of three HCI researchers with diverse research interests to reflect on our daily experience of living and working with ChatGPT. Our findings are presented as three provocations grounded in our collective experiences and HCI theories. Specifically, we e… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  22. arXiv:2309.11646  [pdf, other

    cs.LG

    An Evaluation of Machine Learning Approaches for Early Diagnosis of Autism Spectrum Disorder

    Authors: Rownak Ara Rasul, Promy Saha, Diponkor Bala, S M Rakib Ul Karim, Md. Ibrahim Abdullah, Bishwajit Saha

    Abstract: Autistic Spectrum Disorder (ASD) is a neurological disease characterized by difficulties with social interaction, communication, and repetitive activities. While its primary origin lies in genetics, early detection is crucial, and leveraging machine learning offers a promising avenue for a faster and more cost-effective diagnosis. This study employs diverse machine learning methods to identify cru… ▽ More

    Submitted 28 December, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: 20 pages, 12 figures, 8 tables

  23. arXiv:2308.16735  [pdf, other

    cs.CV cs.AI

    Post-Deployment Adaptation with Access to Source Data via Federated Learning and Source-Target Remote Gradient Alignment

    Authors: Felix Wagner, Zeju Li, Pramit Saha, Konstantinos Kamnitsas

    Abstract: Deployment of Deep Neural Networks in medical imaging is hindered by distribution shift between training data and data processed after deployment, causing performance degradation. Post-Deployment Adaptation (PDA) addresses this by tailoring a pre-trained, deployed model to the target data distribution using limited labelled or entirely unlabelled target data, while assuming no access to source tra… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: This version was accepted for the Machine Learning in Medical Imaging (MLMI 2023) workshop at MICCAI 2023

  24. arXiv:2305.03915  [pdf, other

    cs.CV cs.CL cs.MM

    HateMM: A Multi-Modal Dataset for Hate Video Classification

    Authors: Mithun Das, Rohit Raj, Punyajoy Saha, Binny Mathew, Manish Gupta, Animesh Mukherjee

    Abstract: Hate speech has become one of the most significant issues in modern society, having implications in both the online and the offline world. Due to this, hate speech research has recently gained a lot of traction. However, most of the work has primarily focused on text media with relatively little work on images and even lesser on videos. Thus, early stage automated video moderation techniques are n… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: Accepted at ICWSM 2023(dataset track)

  25. arXiv:2303.10311  [pdf, other

    cs.SI cs.CL cs.CY

    On the rise of fear speech in online social media

    Authors: Punyajoy Saha, Kiran Garimella, Narla Komal Kalyan, Saurabh Kumar Pandey, Pauras Mangesh Meher, Binny Mathew, Animesh Mukherjee

    Abstract: Recently, social media platforms are heavily moderated to prevent the spread of online hate speech, which is usually fertile in toxic words and is directed toward an individual or a community. Owing to such heavy moderation, newer and more subtle techniques are being deployed. One of the most striking among these is fear speech. Fear speech, as the name suggests, attempts to incite fear about a ta… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: 16 pages, 9 tables, 15 figures, accepted in Proceedings of the National Academy of Sciences of the United States of America

  26. HateProof: Are Hateful Meme Detection Systems really Robust?

    Authors: Piush Aggarwal, Pranit Chawla, Mithun Das, Punyajoy Saha, Binny Mathew, Torsten Zesch, Animesh Mukherjee

    Abstract: Exploiting social media to spread hate has tremendously increased over the years. Lately, multi-modal hateful content such as memes has drawn relatively more traction than uni-modal content. Moreover, the availability of implicit content payloads makes them fairly challenging to be detected by existing hateful meme detection systems. In this paper, we present a use case study to analyze such syste… ▽ More

    Submitted 11 February, 2023; originally announced February 2023.

    Comments: Accepted at TheWebConf'2023 (WWW'2023)

  27. arXiv:2211.17046  [pdf, other

    cs.CL cs.CY

    Rationale-Guided Few-Shot Classification to Detect Abusive Language

    Authors: Punyajoy Saha, Divyanshu Sheth, Kushal Kedia, Binny Mathew, Animesh Mukherjee

    Abstract: Abusive language is a concerning problem in online social media. Past research on detecting abusive language covers different platforms, languages, demographies, etc. However, models trained using these datasets do not perform well in cross-domain evaluation settings. To overcome this, a common strategy is to use a few samples from the target domain to train models to get better performance in tha… ▽ More

    Submitted 27 July, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: 11 pages, 14 tables, 3 figures, The code repository is https://github.com/punyajoy/RGFS_ECAI

  28. arXiv:2210.03479  [pdf, other

    cs.CL

    Hate Speech and Offensive Language Detection in Bengali

    Authors: Mithun Das, Somnath Banerjee, Punyajoy Saha, Animesh Mukherjee

    Abstract: Social media often serves as a breeding ground for various hateful and offensive content. Identifying such content on social media is crucial due to its impact on the race, gender, or religion in an unprejudiced society. However, while there is extensive research in hate speech detection in English, there is a gap in hateful content detection in low-resource languages like Bengali. Besides, a curr… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: Accepted at AACL-IJCNLP 2022

  29. Road Rutting Detection using Deep Learning on Images

    Authors: Poonam Kumari Saha, Deeksha Arya, Ashutosh Kumar, Hiroya Maeda, Yoshihide Sekimoto

    Abstract: Road rutting is a severe road distress that can cause premature failure of road incurring early and costly maintenance costs. Research on road damage detection using image processing techniques and deep learning are being actively conducted in the past few years. However, these researches are mostly focused on detection of cracks, potholes, and their variants. Very few research has been done on th… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 9 pages, 7 figures

    ACM Class: E.0; J.0

    Journal ref: 2022 IEEE International Conference on Big Data (Big Data), Osaka, Japan, 2022, pp. 6507-6515

  30. Exploration of Parameter Spaces Assisted by Machine Learning

    Authors: A. Hammad, Myeonghun Park, Raymundo Ramos, Pankaj Saha

    Abstract: We demonstrate two sampling procedures assisted by machine learning models via regression and classification. The main objective is the use of a neural network to suggest points likely inside regions of interest, reducing the number of evaluations of time consuming calculations. We compare results from this approach with results from other sampling methods, namely Markov chain Monte Carlo and Mult… ▽ More

    Submitted 11 January, 2023; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: 25 pages, 8 figures. Added comparisons and more results for 2HDM. Code and instructions are available on https://github.com/AHamamd150/MLscanner

    Journal ref: Comput.Phys.Commun. 293 (2023) 108902

  31. arXiv:2206.13284  [pdf, other

    cs.CL

    Which one is more toxic? Findings from Jigsaw Rate Severity of Toxic Comments

    Authors: Millon Madhur Das, Punyajoy Saha, Mithun Das

    Abstract: The proliferation of online hate speech has necessitated the creation of algorithms which can detect toxicity. Most of the past research focuses on this detection as a classification task, but assigning an absolute toxicity label is often tricky. Hence, few of the past works transform the same task into a regression. This paper shows the comparative evaluation of different transformers and traditi… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

  32. arXiv:2205.11367  [pdf, other

    cs.AI

    Rethinking Task-Incremental Learning Baselines

    Authors: Md Sazzad Hossain, Pritom Saha, Townim Faisal Chowdhury, Shafin Rahman, Fuad Rahman, Nabeel Mohammed

    Abstract: It is common to have continuous streams of new data that need to be introduced in the system in real-world applications. The model needs to learn newly added capabilities (future tasks) while retaining the old knowledge (past tasks). Incremental learning has recently become increasingly appealing for this problem. Task-incremental learning is a kind of incremental learning where task identity of n… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted in ICPR2022

  33. arXiv:2205.04304  [pdf, other

    cs.CL cs.CY

    CounterGeDi: A controllable approach to generate polite, detoxified and emotional counterspeech

    Authors: Punyajoy Saha, Kanishk Singh, Adarsh Kumar, Binny Mathew, Animesh Mukherjee

    Abstract: Recently, many studies have tried to create generation models to assist counter speakers by providing counterspeech suggestions for combating the explosive proliferation of online hate. However, since these suggestions are from a vanilla generation model, they might not include the appropriate properties required to counter a particular hate speech instance. In this paper, we propose CounterGeDi -… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: Accepted at IJCAI-ECAI 2022, 10 pages, 2 figures, 11 tables, Code is available at https://github.com/hate-alert/CounterGEDI

  34. arXiv:2205.00364  [pdf, other

    cs.CV

    RADNet: A Deep Neural Network Model for Robust Perception in Moving Autonomous Systems

    Authors: Burhan A. Mudassar, Sho Ko, Maojingjing Li, Priyabrata Saha, Saibal Mukhopadhyay

    Abstract: Interactive autonomous applications require robustness of the perception engine to artifacts in unconstrained videos. In this paper, we examine the effect of camera motion on the task of action detection. We develop a novel ranking method to rank videos based on the degree of global camera motion. For the high ranking camera videos we show that the accuracy of action detection is decreased. We pro… ▽ More

    Submitted 30 April, 2022; originally announced May 2022.

  35. arXiv:2205.00328  [pdf

    cs.CL

    HateCheckHIn: Evaluating Hindi Hate Speech Detection Models

    Authors: Mithun Das, Punyajoy Saha, Binny Mathew, Animesh Mukherjee

    Abstract: Due to the sheer volume of online hate, the AI and NLP communities have started building models to detect such hateful content. Recently, multilingual hate is a major emerging challenge for automated detection where code-mixing or more than one language have been used for conversation in social media. Typically, hate speech detection models are evaluated by measuring their performance on the held-… ▽ More

    Submitted 30 April, 2022; originally announced May 2022.

    Comments: Accepted at: 13th Edition of its Language Resources and Evaluation Conference. arXiv admin note: text overlap with arXiv:2012.15606 by other authors

  36. arXiv:2203.08655  [pdf, other

    cs.LG cs.CE math.NA

    Unraveled Multilevel Transformation Networks for Predicting Sparsely-Observed Spatiotemporal Dynamics

    Authors: Priyabrata Saha, Saibal Mukhopadhyay

    Abstract: In this paper, we address the problem of predicting complex, nonlinear spatiotemporal dynamics when available data is recorded at irregularly-spaced sparse spatial locations. Most of the existing deep learning models for modeling spatiotemporal dynamics are either designed for data in a regular grid or struggle to uncover the spatial relations from sparse and irregularly-spaced data sites. We prop… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: 16 pages, 7 figures. This manuscript has been accepted for publication in Philosophical Transactions of the Royal Society A

  37. arXiv:2111.14830  [pdf, other

    cs.CL

    Abusive and Threatening Language Detection in Urdu using Boosting based and BERT based models: A Comparative Approach

    Authors: Mithun Das, Somnath Banerjee, Punyajoy Saha

    Abstract: Online hatred is a growing concern on many social media platforms. To address this issue, different social media platforms have introduced moderation policies for such content. They also employ moderators who can check the posts violating moderation policies and take appropriate action. Academicians in the abusive language research domain also perform various studies to detect such content better.… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

    Comments: Accepted in FIRE'21 (Track Abusive and Threatening Language Detection Task in Urdu). arXiv admin note: text overlap with arXiv:2111.13974

  38. arXiv:2111.13974  [pdf, other

    cs.CL

    Exploring Transformer Based Models to Identify Hate Speech and Offensive Content in English and Indo-Aryan Languages

    Authors: Somnath Banerjee, Maulindu Sarkar, Nancy Agrawal, Punyajoy Saha, Mithun Das

    Abstract: Hate speech is considered to be one of the major issues currently plaguing online social media. Repeated and repetitive exposure to hate speech has been shown to create physiological effects on the target users. Thus, hate speech, in all its forms, should be addressed on these platforms in order to maintain good health. In this paper, we explored several Transformer based machine learning models f… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

    Comments: Accepted in FIRE'21 (Track HASOC - English and Indo-Aryan Languages)

  39. arXiv:2108.00524  [pdf, other

    cs.SI cs.CL cs.LG

    You too Brutus! Trapping Hateful Users in Social Media: Challenges, Solutions & Insights

    Authors: Mithun Das, Punyajoy Saha, Ritam Dutt, Pawan Goyal, Animesh Mukherjee, Binny Mathew

    Abstract: Hate speech is regarded as one of the crucial issues plaguing the online social media. The current literature on hate speech detection leverages primarily the textual content to find hateful posts and subsequently identify hateful users. However, this methodology disregards the social connections between users. In this paper, we run a detailed exploration of the problem space and investigate an ar… ▽ More

    Submitted 1 August, 2021; originally announced August 2021.

    Comments: Extended Version of this paper has been accepted at ACM HT'21. Link to the Code: https://github.com/hate-alert/Hateful-users-detection

  40. arXiv:2102.10084  [pdf, other

    cs.CL cs.AI

    Hate-Alert@DravidianLangTech-EACL2021: Ensembling strategies for Transformer-based Offensive language Detection

    Authors: Debjoy Saha, Naman Paharia, Debajit Chakraborty, Punyajoy Saha, Animesh Mukherjee

    Abstract: Social media often acts as breeding grounds for different forms of offensive content. For low resource languages like Tamil, the situation is more complex due to the poor performance of multilingual or language-specific models and lack of proper benchmark datasets. Based on this shared task, Offensive Language Identification in Dravidian Languages at EACL 2021, we present an exhaustive exploration… ▽ More

    Submitted 19 February, 2021; originally announced February 2021.

    Comments: 6 pages, 1 figure, 3 tables, code available at https://github.com/Debjoy10/Hate-Alert-DravidianLangTech

  41. arXiv:2102.03870  [pdf, other

    cs.SI cs.AI cs.CL

    "Short is the Road that Leads from Fear to Hate": Fear Speech in Indian WhatsApp Groups

    Authors: Punyajoy Saha, Binny Mathew, Kiran Garimella, Animesh Mukherjee

    Abstract: WhatsApp is the most popular messaging app in the world. Due to its popularity, WhatsApp has become a powerful and cheap tool for political campaigning being widely used during the 2019 Indian general election, where it was used to connect to the voters on a large scale. Along with the campaigning, there have been reports that WhatsApp has also become a breeding ground for harmful speech against v… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

    Comments: 13 pages, 9 figures, 8 tables, Accepted at The Web Conference 2021, code and dataset public at https://github.com/punyajoy/Fear-Speech-analysis

  42. "Facebook Promotes More Harassment": Social Media Ecosystem, Skill and Marginalized Hijra Identity in Bangladesh

    Authors: Fayika Farhat Nova, Michael Ann Devito, Pratyasha Saha, Kazi Shohanur Rashid, Shashwata Roy Turzo, Sadia Afrin, Shion Guha

    Abstract: Social interaction across multiple online platforms is a challenge for gender and sexual minorities (GSM) due to the stigmatization they face, which increases the complexity of their self-presentation decisions. These online interactions and identity disclosures can be more complicated for GSM in non-Western contexts due to consequentially different audiences and perceived affordances by the users… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

    Comments: 35 pages, 9 figures, CSCW 2021

  43. arXiv:2102.01640  [pdf, other

    cs.SD cs.CL eess.AS

    SPEAK WITH YOUR HANDS Using Continuous Hand Gestures to control Articulatory Speech Synthesizer

    Authors: Pramit Saha, Debasish Ray Mohapatra, Sidney Fels

    Abstract: This work presents our advancements in controlling an articulatory speech synthesis engine, \textit{viz.}, Pink Trombone, with hand gestures. Our interface translates continuous finger movements and wrist flexion into continuous speech using vocal tract area-function based articulatory speech synthesis. We use Cyberglove II with 18 sensors to capture the kinematic information of the wrist and the… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

    Comments: 2 pages, 1 figure

  44. arXiv:2101.00454  [pdf, other

    cs.DL

    Mining the online infosphere: A survey

    Authors: Sayantan Adak, Souvic Chakraborty, Paramtia Das, Mithun Das, Abhisek Dash, Rima Hazra, Binny Mathew, Punyajoy Saha, Soumya Sarkar, Animesh Mukherjee

    Abstract: The evolution of AI-based system and applications had pervaded everyday life to make decisions that have momentous impact on individuals and society. With the staggering growth of online data, often termed as the Online Infosphere it has become paramount to monitor the infosphere to ensure social good as the AI-based decisions are severely dependent on it. The goal of this survey is to provide a c… ▽ More

    Submitted 2 January, 2021; originally announced January 2021.

    Comments: 29 pages

  45. arXiv:2012.10289  [pdf, other

    cs.CL cs.AI cs.SI

    HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection

    Authors: Binny Mathew, Punyajoy Saha, Seid Muhie Yimam, Chris Biemann, Pawan Goyal, Animesh Mukherjee

    Abstract: Hate speech is a challenging issue plaguing the online social media. While better models for hate speech detection are continuously being developed, there is little research on the bias and interpretability aspects of hate speech. In this paper, we introduce HateXplain, the first benchmark hate speech dataset covering multiple aspects of the issue. Each post in our dataset is annotated from three… ▽ More

    Submitted 12 April, 2022; v1 submitted 18 December, 2020; originally announced December 2020.

    Comments: 12 pages, 7 figues, 8 tables. Accepted at AAAI 2021

  46. arXiv:2011.14965  [pdf, other

    stat.ML cs.LG math.AP math.NA

    A Deep Learning Approach for Predicting Spatiotemporal Dynamics From Sparsely Observed Data

    Authors: Priyabrata Saha, Saibal Mukhopadhyay

    Abstract: In this paper, we consider the problem of learning prediction models for spatiotemporal physical processes driven by unknown partial differential equations (PDEs). We propose a deep learning framework that learns the underlying dynamics and predicts its evolution using sparsely distributed data sites. Deep learning has shown promising results in modeling physical dynamics in recent years. However,… ▽ More

    Submitted 1 May, 2021; v1 submitted 30 November, 2020; originally announced November 2020.

    Comments: 11 pages, 10 figures; Accepted manuscript IEEE Access

    Journal ref: IEEE Access, vol. 9, pp. 64200-64210, 2021

  47. arXiv:2009.11782  [pdf, other

    eess.SY cs.LG cs.RO

    Neural Identification for Control

    Authors: Priyabrata Saha, Magnus Egerstedt, Saibal Mukhopadhyay

    Abstract: We present a new method for learning control law that stabilizes an unknown nonlinear dynamical system at an equilibrium point. We formulate a system identification task in a self-supervised learning setting that jointly learns a controller and corresponding stable closed-loop dynamics hypothesis. The input-output behavior of the unknown dynamical system under random control inputs is used as the… ▽ More

    Submitted 15 March, 2022; v1 submitted 24 September, 2020; originally announced September 2020.

    Comments: Copyright 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

    Journal ref: IEEE Robotics and Automation Letters, vol. 6, no. 3, pp. 4648-4655, July 2021

  48. arXiv:2007.12841  [pdf, other

    cs.CY cs.HC

    Combating Misinformation in Bangladesh: Roles and Responsibilities as Perceived by Journalists, Fact-checkers, and Users

    Authors: Md Mahfuzul Haque, Mohammad Yousuf, Ahmed Shatil Alam, Pratyasha Saha, Syed Ishtiaque Ahmed, Naeemul Hassan

    Abstract: There has been a growing interest within CSCW community in understanding the characteristics of misinformation propagated through computational media, and the devising techniques to address the associated challenges. However, most work in this area has been concentrated on the cases in the western world leaving a major portion of this problem unaddressed that is situated in the Global South. This… ▽ More

    Submitted 27 August, 2020; v1 submitted 24 July, 2020; originally announced July 2020.

  49. arXiv:2006.16367  [pdf, other

    eess.IV cs.LG cs.SD eess.AS stat.ML

    Ultra2Speech -- A Deep Learning Framework for Formant Frequency Estimation and Tracking from Ultrasound Tongue Images

    Authors: Pramit Saha, Yadong Liu, Bryan Gick, Sidney Fels

    Abstract: Thousands of individuals need surgical removal of their larynx due to critical diseases every year and therefore, require an alternative form of communication to articulate speech sounds after the loss of their voice box. This work addresses the articulatory-to-acoustic mapping problem based on ultrasound (US) tongue images for the development of a silent-speech interface (SSI) that can provide th… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: Accepted for publication in MICCAI 2020

  50. arXiv:2006.04205  [pdf, other

    cond-mat.str-el cond-mat.dis-nn cond-mat.mes-hall cs.LG

    Machine learning dynamics of phase separation in correlated electron magnets

    Authors: Puhan Zhang, Preetha Saha, Gia-Wei Chern

    Abstract: We demonstrate machine-learning enabled large-scale dynamical simulations of electronic phase separation in double-exchange system. This model, also known as the ferromagnetic Kondo lattice model, is believed to be relevant for the colossal magnetoresistance phenomenon. Real-space simulations of such inhomogeneous states with exchange forces computed from the electron Hamiltonian can be prohibitiv… ▽ More

    Submitted 7 June, 2020; originally announced June 2020.

    Comments: 6 pages, 4 figures