Skip to main content

Showing 1–50 of 93 results for author: Yadav, S

  1. arXiv:2407.04130  [pdf, other

    cs.CL

    Towards Automating Text Annotation: A Case Study on Semantic Proximity Annotation using GPT-4

    Authors: Sachin Yadav, Tejaswi Choppa, Dominik Schlechtweg

    Abstract: This paper explores using GPT-3.5 and GPT-4 to automate the data annotation process with automatic prompting techniques. The main aim of this paper is to reuse human annotation guidelines along with some annotated data to design automatic prompts for LLMs, focusing on the semantic proximity annotation task. Automatic prompts are compared to customized prompts. We further implement the prompting st… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 12 pages

  2. arXiv:2406.19238  [pdf, other

    cs.CL cs.CY cs.LG

    Revealing Fine-Grained Values and Opinions in Large Language Models

    Authors: Dustin Wright, Arnav Arora, Nadav Borenstein, Srishti Yadav, Serge Belongie, Isabelle Augenstein

    Abstract: Uncovering latent values and opinions in large language models (LLMs) can help identify biases and mitigate potential harm. Recently, this has been approached by presenting LLMs with survey questions and quantifying their stances towards morally and politically charged statements. However, the stances generated by LLMs can vary greatly depending on how they are prompted, and there are many ways to… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 28 pages, 20 figures, 7 tables

  3. arXiv:2406.10166  [pdf, other

    cs.LG

    Misam: Using ML in Dataflow Selection of Sparse-Sparse Matrix Multiplication

    Authors: Sanjali Yadav, Bahar Asgari

    Abstract: Sparse matrix-matrix multiplication (SpGEMM) is a critical operation in numerous fields, including scientific computing, graph analytics, and deep learning. These applications exploit the sparsity of matrices to reduce storage and computational demands. However, the irregular structure of sparse matrices poses significant challenges for performance optimization. Traditional hardware accelerators a… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted to ISCA 2024 MLArchSys workshop https://openreview.net/forum?id=A1V9FaZRbV

  4. arXiv:2406.08881  [pdf, other

    cs.CL

    No perspective, no perception!! Perspective-aware Healthcare Answer Summarization

    Authors: Gauri Naik, Sharad Chandakacherla, Shweta Yadav, Md. Shad Akhtar

    Abstract: Healthcare Community Question Answering (CQA) forums offer an accessible platform for individuals seeking information on various healthcare-related topics. People find such platforms suitable for self-disclosure, seeking medical opinions, finding simplified explanations for their medical conditions, and answering others' questions. However, answers on these forums are typically diverse and prone t… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: ACL 2024 Findings

  5. arXiv:2406.02178  [pdf, other

    cs.SD cs.AI eess.AS

    Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations

    Authors: Sarthak Yadav, Zheng-Hua Tan

    Abstract: Despite its widespread adoption as the prominent neural architecture, the Transformer has spurred several independent lines of work to address its limitations. One such approach is selective state space models, which have demonstrated promising results for language modelling. However, their feasibility for learning self-supervised, general-purpose audio representations is yet to be investigated. T… ▽ More

    Submitted 7 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted at INTERSPEECH 2024

  6. arXiv:2406.00247  [pdf, other

    cs.IR cs.AI

    Large Language Models for Relevance Judgment in Product Search

    Authors: Navid Mehrdad, Hrushikesh Mohapatra, Mossaab Bagdouri, Prijith Chandran, Alessandro Magnani, Xunfan Cai, Ajit Puthenputhussery, Sachin Yadav, Tony Lee, ChengXiang Zhai, Ciya Liao

    Abstract: High relevance of retrieved and re-ranked items to the search query is the cornerstone of successful product search, yet measuring relevance of items to queries is one of the most challenging tasks in product information retrieval, and quality of product search is highly influenced by the precision and scale of available relevance-labelled data. In this paper, we present an array of techniques for… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: 10 pages, 1 figure, 11 tables - SIGIR 2024, LLM4Eval

    ACM Class: H.3.3; I.2.7

  7. arXiv:2405.06295  [pdf, other

    cs.CL cs.AI

    Aspect-oriented Consumer Health Answer Summarization

    Authors: Rochana Chaturvedi, Abari Bhattacharya, Shweta Yadav

    Abstract: Community Question-Answering (CQA) forums have revolutionized how people seek information, especially those related to their healthcare needs, placing their trust in the collective wisdom of the public. However, there can be several answers in response to a single query, which makes it hard to grasp the key information related to the specific health concern. Typically, CQA forums feature a single… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    ACM Class: H.4.3; I.2.7; J.3; J.7; K.6.4

  8. arXiv:2405.01587  [pdf

    cs.CL cs.AI cs.CV cs.LG

    Improve Academic Query Resolution through BERT-based Question Extraction from Images

    Authors: Nidhi Kamal, Saurabh Yadav, Jorawar Singh, Aditi Avasthi

    Abstract: Providing fast and accurate resolution to the student's query is an essential solution provided by Edtech organizations. This is generally provided with a chat-bot like interface to enable students to ask their doubts easily. One preferred format for student queries is images, as it allows students to capture and post questions without typing complex equations and information. However, this format… ▽ More

    Submitted 28 April, 2024; originally announced May 2024.

    Journal ref: 2024 IEEE International Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI) volume 2 (2024) 1-4

  9. arXiv:2404.17105  [pdf, other

    cs.CV

    Synthesizing Iris Images using Generative Adversarial Networks: Survey and Comparative Analysis

    Authors: Shivangi Yadav, Arun Ross

    Abstract: Biometric systems based on iris recognition are currently being used in border control applications and mobile devices. However, research in iris recognition is stymied by various factors such as limited datasets of bonafide irides and presentation attack instruments; restricted intra-class variations; and privacy concerns. Some of these issues can be mitigated by the use of synthetic iris data. I… ▽ More

    Submitted 11 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  10. arXiv:2404.14219  [pdf, other

    cs.CL cs.AI

    Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

    Authors: Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Qin Cai, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Yen-Chun Chen, Yi-Ling Chen, Parul Chopra , et al. (90 additional authors not shown)

    Abstract: We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone. The innovation lies entirely in our dataset… ▽ More

    Submitted 23 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 19 pages

  11. arXiv:2404.10989  [pdf, other

    cs.CV cs.LG cs.MM cs.SD eess.AS

    FairSSD: Understanding Bias in Synthetic Speech Detectors

    Authors: Amit Kumar Singh Yadav, Kratika Bhagtani, Davide Salvi, Paolo Bestagini, Edward J. Delp

    Abstract: Methods that can generate synthetic speech which is perceptually indistinguishable from speech recorded by a human speaker, are easily available. Several incidents report misuse of synthetic speech generated from these methods to commit fraud. To counter such misuse, many methods have been proposed to detect synthetic speech. Some of these detectors are more interpretable, can generalize to detect… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPR 2024 (WMF)

  12. arXiv:2404.08888  [pdf, other

    cs.CL cs.LG

    Towards Enhancing Health Coaching Dialogue in Low-Resource Settings

    Authors: Yue Zhou, Barbara Di Eugenio, Brian Ziebart, Lisa Sharp, Bing Liu, Ben Gerber, Nikolaos Agadakos, Shweta Yadav

    Abstract: Health coaching helps patients identify and accomplish lifestyle-related goals, effectively improving the control of chronic diseases and mitigating mental health conditions. However, health coaching is cost-prohibitive due to its highly personalized and labor-intensive nature. In this paper, we propose to build a dialogue system that converses with the patients, helps them create and accomplish s… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted to the main conference of COLING 2022

  13. arXiv:2403.15484  [pdf, other

    cs.CL cs.LG

    RakutenAI-7B: Extending Large Language Models for Japanese

    Authors: Rakuten Group, Aaron Levine, Connie Huang, Chenguang Wang, Eduardo Batista, Ewa Szymanska, Hongyi Ding, Hou Wei Chou, Jean-François Pessiot, Johanes Effendi, Justin Chiu, Kai Torben Ohlhus, Karan Chopra, Keiji Shinzato, Koji Murakami, Lee Xiong, Lei Chen, Maki Kubota, Maksim Tkachenko, Miroku Lee, Naoki Takahashi, Prathyusha Jwalapuram, Ryutaro Tatsushima, Saurabh Jain, Sunil Kumar Yadav , et al. (5 additional authors not shown)

    Abstract: We introduce RakutenAI-7B, a suite of Japanese-oriented large language models that achieve the best performance on the Japanese LM Harness benchmarks among the open 7B models. Along with the foundation model, we release instruction- and chat-tuned models, RakutenAI-7B-instruct and RakutenAI-7B-chat respectively, under the Apache 2.0 license.

    Submitted 21 March, 2024; originally announced March 2024.

  14. arXiv:2403.09709  [pdf, other

    cs.CL

    Exploratory Data Analysis on Code-mixed Misogynistic Comments

    Authors: Sargam Yadav, Abhishek Kaushik, Kevin McDaid

    Abstract: The problems of online hate speech and cyberbullying have significantly worsened since the increase in popularity of social media platforms such as YouTube and Twitter (X). Natural Language Processing (NLP) techniques have proven to provide a great advantage in automatic filtering such toxic content. Women are disproportionately more likely to be victims of online abuse. However, there appears to… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: This paper is accepted in the 16th ISDSI-Global Conference 2023 https://isdsi2023.iimranchi.ac.in/

  15. arXiv:2403.02121  [pdf, other

    cs.CL cs.AI

    Leveraging Weakly Annotated Data for Hate Speech Detection in Code-Mixed Hinglish: A Feasibility-Driven Transfer Learning Approach with Large Language Models

    Authors: Sargam Yadav, Abhishek Kaushik, Kevin McDaid

    Abstract: The advent of Large Language Models (LLMs) has advanced the benchmark in various Natural Language Processing (NLP) tasks. However, large amounts of labelled training data are required to train LLMs. Furthermore, data annotation and training are computationally expensive and time-consuming. Zero and few-shot learning have recently emerged as viable options for labelling data using large pre-trained… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: This paper is accepted in the 16th ISDSI-Global Conference 2023 https://isdsi2023.iimranchi.ac.in

  16. arXiv:2402.14205  [pdf, other

    cs.SD cs.CV cs.LG eess.AS eess.SP

    Compression Robust Synthetic Speech Detection Using Patched Spectrogram Transformer

    Authors: Amit Kumar Singh Yadav, Ziyue Xiang, Kratika Bhagtani, Paolo Bestagini, Stefano Tubaro, Edward J. Delp

    Abstract: Many deep learning synthetic speech generation tools are readily available. The use of synthetic speech has caused financial fraud, impersonation of people, and misinformation to spread. For this reason forensic methods that can detect synthetic speech have been proposed. Existing methods often overfit on one dataset and their performance reduces substantially in practical scenarios such as detect… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted as long oral paper at ICMLA 2023

  17. arXiv:2402.10783  [pdf, other

    cs.DS cs.CC

    On Permutation Selectors and their Applications in Ad-Hoc Radio Networks Protocols

    Authors: Jordan Kuschner, Yugarshi Shashwat, Sarthak Yadav, Marek Chrobak

    Abstract: Selective families of sets, or selectors, are combinatorial tools used to "isolate" individual members of sets from some set family. Given a set $X$ and an element $x\in X$, to isolate $x$ from $X$, at least one of the sets in the selector must intersect $X$ on exactly $x$. We study (k,N)-permutation selectors which have the property that they can isolate each element of each $k$-element subset of… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 9 pages, 2 figures

  18. arXiv:2401.16227  [pdf, other

    cs.CV eess.IV

    A Volumetric Saliency Guided Image Summarization for RGB-D Indoor Scene Classification

    Authors: Preeti Meena, Himanshu Kumar, Sandeep Yadav

    Abstract: Image summary, an abridged version of the original visual content, can be used to represent the scene. Thus, tasks such as scene classification, identification, indexing, etc., can be performed efficiently using the unique summary. Saliency is the most commonly used technique for generating the relevant image summary. However, the definition of saliency is subjective in nature and depends upon the… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  19. arXiv:2311.09086  [pdf, other

    cs.CL cs.AI cs.SI

    The Uli Dataset: An Exercise in Experience Led Annotation of oGBV

    Authors: Arnav Arora, Maha Jinadoss, Cheshta Arora, Denny George, Brindaalakshmi, Haseena Dawood Khan, Kirti Rawat, Div, Ritash, Seema Mathur, Shivani Yadav, Shehla Rashid Shora, Rie Raut, Sumit Pawar, Apurva Paithane, Sonia, Vivek, Dharini Priscilla, Khairunnisha, Grace Banu, Ambika Tandon, Rishav Thakker, Rahul Dev Korra, Aatman Vaidya, Tarunima Prabhakar

    Abstract: Online gender based violence has grown concomitantly with adoption of the internet and social media. Its effects are worse in the Global majority where many users use social media in languages other than English. The scale and volume of conversations on the internet has necessitated the need for automated detection of hate speech, and more specifically gendered abuse. There is, however, a lack of… ▽ More

    Submitted 24 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  20. arXiv:2311.05628  [pdf

    cs.CY cs.SE

    A Design and Development of Rubrics System for Android Applications

    Authors: Kaustubh Kundu, Sushant Yadav, Tayyabbali Sayyad

    Abstract: Online grading systems have become extremely prevalent as majority of academic materials are in the process of being digitized, if not already done. In this paper, we present the concept of design and implementation of a mobile application for "Student Evaluation System", envisaged with the purpose of making the task of evaluation of students performance by faculty and graders facile. This applica… ▽ More

    Submitted 23 September, 2023; originally announced November 2023.

    Comments: American Journal of Engineering Research (AJER)

  21. arXiv:2311.02924  [pdf, ps, other

    cs.HC

    AttentioNet: Monitoring Student Attention Type in Learning with EEG-Based Measurement System

    Authors: Dhruv Verma, Sejal Bhalla, S. V. Sai Santosh, Saumya Yadav, Aman Parnami, Jainendra Shukla

    Abstract: Student attention is an indispensable input for uncovering their goals, intentions, and interests, which prove to be invaluable for a multitude of research areas, ranging from psychology to interactive systems. However, most existing methods to classify attention fail to model its complex nature. To bridge this gap, we propose AttentioNet, a novel Convolutional Neural Network-based approach that u… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 8 pages, 4 figures, Accepted in AFFECTIVE COMPUTING + INTELLIGENT INTERACTION Conference 2023

    ACM Class: I.2.6; K.3.1

  22. arXiv:2309.10359  [pdf, other

    cs.CL

    Prompt, Condition, and Generate: Classification of Unsupported Claims with In-Context Learning

    Authors: Peter Ebert Christensen, Srishti Yadav, Serge Belongie

    Abstract: Unsupported and unfalsifiable claims we encounter in our daily lives can influence our view of the world. Characterizing, summarizing, and -- more generally -- making sense of such claims, however, can be challenging. In this work, we focus on fine-grained debate topics and formulate a new task of distilling, from such claims, a countable set of narratives. We present a crowdsourced dataset of 12… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  23. arXiv:2309.09195  [pdf, other

    cs.LG cs.AI cs.CL cs.NE

    SplitEE: Early Exit in Deep Neural Networks with Split Computing

    Authors: Divya J. Bajpai, Vivek K. Trivedi, Sohan L. Yadav, Manjesh K. Hanawal

    Abstract: Deep Neural Networks (DNNs) have drawn attention because of their outstanding performance on various tasks. However, deploying full-fledged DNNs in resource-constrained devices (edge, mobile, IoT) is difficult due to their large size. To overcome the issue, various approaches are considered, like offloading part of the computation to the cloud for final inference (split computing) or performing th… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: 10 pages, to appear in the proceeding AIMLSystems 2023

  24. arXiv:2306.15768  [pdf

    cs.CV

    An Efficient Deep Convolutional Neural Network Model For Yoga Pose Recognition Using Single Images

    Authors: Santosh Kumar Yadav, Apurv Shukla, Kamlesh Tiwari, Hari Mohan Pandey, Shaik Ali Akbar

    Abstract: Pose recognition deals with designing algorithms to locate human body joints in a 2D/3D space and run inference on the estimated joint locations for predicting the poses. Yoga poses consist of some very complex postures. It imposes various challenges on the computer vision algorithms like occlusion, inter-class similarity, intra-class variability, viewpoint complexity, etc. This paper presents YPo… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  25. arXiv:2306.15765  [pdf

    cs.CV

    A Novel Two Stream Decision Level Fusion of Vision and Inertial Sensors Data for Automatic Multimodal Human Activity Recognition System

    Authors: Santosh Kumar Yadav, Muhtashim Rafiqi, Egna Praneeth Gummana, Kamlesh Tiwari, Hari Mohan Pandey, Shaik Ali Akbara

    Abstract: This paper presents a novel multimodal human activity recognition system. It uses a two-stream decision level fusion of vision and inertial sensors. In the first stream, raw RGB frames are passed to a part affinity field-based pose estimation network to detect the keypoints of the user. These keypoints are then pre-processed and inputted in a sliding window fashion to a specially designed convolut… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  26. arXiv:2306.00561  [pdf, other

    cs.SD cs.AI eess.AS

    Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners

    Authors: Sarthak Yadav, Sergios Theodoridis, Lars Kai Hansen, Zheng-Hua Tan

    Abstract: In this work, we propose a Multi-Window Masked Autoencoder (MW-MAE) fitted with a novel Multi-Window Multi-Head Attention (MW-MHA) module that facilitates the modelling of local-global interactions in every decoder transformer block through attention heads of several distinct local and global windows. Empirical results on ten downstream audio tasks show that MW-MAEs consistently outperform standar… ▽ More

    Submitted 1 October, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

  27. arXiv:2305.12596  [pdf, other

    cs.CV

    iWarpGAN: Disentangling Identity and Style to Generate Synthetic Iris Images

    Authors: Shivangi Yadav, Arun Ross

    Abstract: Generative Adversarial Networks (GANs) have shown success in approximating complex distributions for synthetic image generation. However, current GAN-based methods for generating biometric images, such as iris, have certain limitations: (a) the synthetic images often closely resemble images in the training dataset; (b) the generated images lack diversity in terms of the number of unique identities… ▽ More

    Submitted 29 August, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

  28. arXiv:2304.03323  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    DSVAE: Interpretable Disentangled Representation for Synthetic Speech Detection

    Authors: Amit Kumar Singh Yadav, Kratika Bhagtani, Ziyue Xiang, Paolo Bestagini, Stefano Tubaro, Edward J. Delp

    Abstract: Tools to generate high quality synthetic speech signal that is perceptually indistinguishable from speech recorded from human speakers are easily available. Several approaches have been proposed for detecting synthetic speech. Many of these approaches use deep learning methods as a black box without providing reasoning for the decisions they make. This limits the interpretability of these approach… ▽ More

    Submitted 28 July, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

  29. arXiv:2303.06309  [pdf

    cs.HC cs.LG

    Virtual Mouse And Assistant: A Technological Revolution Of Artificial Intelligence

    Authors: Jagbeer Singh, Yash Goel, Shubhi Jain, Shiva Yadav

    Abstract: The purpose of this paper is to enhance the performance of the virtual assistant. So, what exactly is a virtual assistant. Application software, often called virtual assistants, also known as AI assistants or digital assistants, is software that understands natural language voice commands and can perform tasks on your behalf. What does a virtual assistant do. Virtual assistants can complete practi… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

  30. arXiv:2303.02609  [pdf

    cs.CY cs.HC cs.SI

    Socialbots and the Challenges of Cyberspace Awareness

    Authors: Shashank Yadav

    Abstract: As security communities brace for the emerging social automation based threats, we examine the mechanisms of developing situation awareness in cyberspace and the governance issues that socialbots bring into this existing paradigm of cyber situation awareness. We point out that an organisation's situation awareness in cyberspace is a phenomena fundamentally distinct from the original conception of… ▽ More

    Submitted 30 May, 2023; v1 submitted 5 March, 2023; originally announced March 2023.

  31. Learning Vision-based Robotic Manipulation Tasks Sequentially in Offline Reinforcement Learning Settings

    Authors: Sudhir Pratap Yadav, Rajendra Nagar, Suril V. Shah

    Abstract: With the rise of deep reinforcement learning (RL) methods, many complex robotic manipulation tasks are being solved. However, harnessing the full power of deep learning requires large datasets. Online-RL does not suit itself readily into this paradigm due to costly and time-taking agent environment interaction. Therefore recently, many offline-RL algorithms have been proposed to learn robotic task… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

    Comments: 7 pages, 5 Figures

  32. arXiv:2212.03384  [pdf

    cs.CV

    DroneAttention: Sparse Weighted Temporal Attention for Drone-Camera Based Activity Recognition

    Authors: Santosh Kumar Yadav, Achleshwar Luthra, Esha Pahwa, Kamlesh Tiwari, Heena Rathore, Hari Mohan Pandey, Peter Corcoran

    Abstract: Human activity recognition (HAR) using drone-mounted cameras has attracted considerable interest from the computer vision research community in recent years. A robust and efficient HAR system has a pivotal role in fields like video surveillance, crowd behavior analysis, sports analysis, and human-computer interaction. What makes it challenging are the complex poses, understanding different viewpoi… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2211.05531

  33. Deep Gaussian Processes for Air Quality Inference

    Authors: Aadesh Desai, Eshan Gujarathi, Saagar Parikh, Sachin Yadav, Zeel Patel, Nipun Batra

    Abstract: Air pollution kills around 7 million people annually, and approximately 2.4 billion people are exposed to hazardous air pollution. Accurate, fine-grained air quality (AQ) monitoring is essential to control and reduce pollution. However, AQ station deployment is sparse, and thus air quality inference for unmonitored locations is crucial. Conventional interpolation methods fail to learn the complex… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: Accepted for publication at ACM India Joint International Conference on Data Science and Management of Data (CoDS-COMAD 2023)

  34. arXiv:2211.05531  [pdf

    cs.CV

    SWTF: Sparse Weighted Temporal Fusion for Drone-Based Activity Recognition

    Authors: Santosh Kumar Yadav, Esha Pahwa, Achleshwar Luthra, Kamlesh Tiwari, Hari Mohan Pandey, Peter Corcoran

    Abstract: Drone-camera based human activity recognition (HAR) has received significant attention from the computer vision research community in the past few years. A robust and efficient HAR system has a pivotal role in fields like video surveillance, crowd behavior analysis, sports analysis, and human-computer interaction. What makes it challenging are the complex poses, understanding different viewpoints,… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

  35. arXiv:2208.02675  [pdf, other

    cs.AI math.OC stat.AP

    Development of fully intuitionistic fuzzy data envelopment analysis model with missing data: an application to Indian police sector

    Authors: Anjali Sonkariya, Awadh Pratap Singh, Shiv Prasad Yadav

    Abstract: Data Envelopment Analysis (DEA) is a technique used to measure the efficiency of decision-making units (DMUs). In order to measure the efficiency of DMUs, the essential requirement is input-output data. Data is usually collected by humans, machines, or both. Due to human/machine errors, there are chances of having some missing values or inaccuracy, such as vagueness/uncertainty/hesitation in the c… ▽ More

    Submitted 27 July, 2022; originally announced August 2022.

  36. arXiv:2206.10594  [pdf

    cs.SI

    How is Vaping Framed on Online Knowledge Dissemination Platforms?

    Authors: Keyu Chen, Yiwen Shi, Jun Luo, Joyce Jiang, Shweta Yadav, Munmun De Choudhury, Ashiqur R. KhudaBukhsh, Marzieh Babaeianjelodar, Frederick Altice, Navin Kumar

    Abstract: We analyze 1,888 articles and 1,119,453 vaping posts to study how vaping is framed across multiple knowledge dissemination platforms (Wikipedia, Quora, Medium, Reddit, Stack Exchange, wikiHow). We use various NLP techniques to understand these differences. For example, n-grams, emotion recognition, and question answering results indicate that Medium, Quora, and Stack Exchange are appropriate venue… ▽ More

    Submitted 22 July, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: text overlap with arXiv:2206.07765, arXiv:2206.09024

  37. arXiv:2206.09024  [pdf

    cs.SI

    Partisan US News Media Representations of Syrian Refugees

    Authors: Keyu Chen, Marzieh Babaeianjelodar, Yiwen Shi, Kamila Janmohamed, Rupak Sarkar, Ingmar Weber, Thomas Davidson, Munmun De Choudhury, Jonathan Huang, Shweta Yadav, Ashique Khudabukhsh, Preslav Ivanov Nakov, Chris Bauch, Orestis Papakyriakopoulos, Kaveh Khoshnood, Navin Kumar

    Abstract: We investigate how representations of Syrian refugees (2011-2021) differ across US partisan news outlets. We analyze 47,388 articles from the online US media about Syrian refugees to detail differences in reporting between left- and right-leaning media. We use various NLP techniques to understand these differences. Our polarization and question answering results indicated that left-leaning media t… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  38. arXiv:2206.07765  [pdf

    cs.SI

    US News and Social Media Framing around Vaping

    Authors: Keyu Chen, Marzieh Babaeianjelodar, Yiwen Shi, Rohan Aanegola, Lam Yin Cheung, Preslav Ivanov Nakov, Shweta Yadav, Angus Bancroft, Ashiqur R. KhudaBukhsh, Munmun De Choudhury, Frederick L. Altice, Navin Kumar

    Abstract: In this paper, we investigate how vaping is framed differently (2008-2021) between US news and social media. We analyze 15,711 news articles and 1,231,379 Facebook posts about vaping to study the differences in framing between media varieties. We use word embeddings to provide two-dimensional visualizations of the semantic changes around vaping for news and for social media. We detail that news me… ▽ More

    Submitted 22 July, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

  39. arXiv:2206.06581  [pdf, other

    cs.CL

    CHQ-Summ: A Dataset for Consumer Healthcare Question Summarization

    Authors: Shweta Yadav, Deepak Gupta, Dina Demner-Fushman

    Abstract: The quest for seeking health information has swamped the web with consumers' health-related questions. Generally, consumers use overly descriptive and peripheral information to express their medical condition or other healthcare needs, contributing to the challenges of natural language understanding. One way to address this challenge is to summarize the questions and distill the key information of… ▽ More

    Submitted 15 June, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

  40. arXiv:2205.04830  [pdf

    cs.CY cs.SI

    Political Propagation of Social Botnets: Policy Consequences

    Authors: Shashank Yadav

    Abstract: The 2016 US election was a watershed event where an electoral intervention by an adversarial state made extensive use of networks of software robots and data driven communications which transformed the interference into a goal driven functionality of man-machine collaboration. Reviewing the debates post the debacle, we reflect upon the policy consequences of the use of Social Botnets and understan… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

  41. arXiv:2205.02543  [pdf, other

    cs.CV

    OCR Synthetic Benchmark Dataset for Indic Languages

    Authors: Naresh Saini, Promodh Pinto, Aravinth Bheemaraj, Deepak Kumar, Dhiraj Daga, Saurabh Yadav, Srihari Nagaraj

    Abstract: We present the largest publicly available synthetic OCR benchmark dataset for Indic languages. The collection contains a total of 90k images and their ground truth for 23 Indic languages. OCR model validation in Indic languages require a good amount of diverse data to be processed in order to create a robust and reliable model. Generating such a huge amount of data would be difficult otherwise but… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

  42. arXiv:2204.12067  [pdf, other

    cs.CV cs.MM

    An Overview of Recent Work in Media Forensics: Methods and Threats

    Authors: Kratika Bhagtani, Amit Kumar Singh Yadav, Emily R. Bartusiak, Ziyue Xiang, Ruiting Shao, Sriram Baireddy, Edward J. Delp

    Abstract: In this paper, we review recent work in media forensics for digital images, video, audio (specifically speech), and documents. For each data modality, we discuss synthesis and manipulation techniques that can be used to create and modify digital media. We then review technological advancements for detecting and quantifying such manipulations. Finally, we consider open issues and suggest directions… ▽ More

    Submitted 12 May, 2022; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: This is a longer version of a paper accepted to the 2022 IEEE International Conference on Multimedia Information Processing and Retrieval entitled "An Overview of Recent Work in Multimedia Forensics"

  43. arXiv:2203.15519  [pdf, other

    cs.SD cs.LG eess.AS

    Learning neural audio features without supervision

    Authors: Sarthak Yadav, Neil Zeghidour

    Abstract: Deep audio classification, traditionally cast as training a deep neural network on top of mel-filterbanks in a supervised fashion, has recently benefited from two independent lines of work. The first one explores "learnable frontends", i.e., neural modules that produce a learnable time-frequency representation, to overcome limitations of fixed features. The second one uses self-supervised learning… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Submitted to Interspeech 2022

  44. arXiv:2112.07411  [pdf, ps, other

    cs.CR cs.IT

    INRU: A Quasigroup Based Lightweight Block Cipher

    Authors: Sharwan K. Tiwari, Ambrish Awasthi, Sucheta Chkrabarti, Sudha Yadav

    Abstract: In this paper, we propose a quasigroup based block cipher design. The round functions of the encryption and decryption algorithms use quasigroup based string transformations. We show the robustness of the design against the standard differential, linear and algebraic cryptanalytic attacks. We also provide detailed statistical analysis using NIST test suite in CBC, CFB, OFB, and CTR modes of operat… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

  45. arXiv:2111.13365  [pdf

    cs.CY cs.AI cs.SI

    Machines & Influence: An Information Systems Lens

    Authors: Shashank Yadav

    Abstract: Policymakers face a broader challenge of how to view AI capabilities today and where does society stand in terms of those capabilities. This paper surveys AI capabilities and tackles this very issue, exploring it in context of political security in digitally networked societies. We extend the ideas of Information Management to better understand contemporary AI systems as part of a larger and more… ▽ More

    Submitted 19 February, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

  46. arXiv:2109.08044  [pdf, other

    eess.IV cs.CV

    Eformer: Edge Enhancement based Transformer for Medical Image Denoising

    Authors: Achleshwar Luthra, Harsh Sulakhe, Tanish Mittal, Abhishek Iyer, Santosh Yadav

    Abstract: In this work, we present Eformer - Edge enhancement based transformer, a novel architecture that builds an encoder-decoder network using transformer blocks for medical image denoising. Non-overlapping window-based self-attention is used in the transformer block that reduces computational requirements. This work further incorporates learnable Sobel-Feldman operators to enhance edges in the image an… ▽ More

    Submitted 9 November, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

    Comments: Accepted in ICCVW'2021

  47. arXiv:2107.00596  [pdf, other

    cs.CL

    Multimodal Graph-based Transformer Framework for Biomedical Relation Extraction

    Authors: Sriram Pingali, Shweta Yadav, Pratik Dutta, Sriparna Saha

    Abstract: The recent advancement of pre-trained Transformer models has propelled the development of effective text mining models across various biomedical tasks. However, these models are primarily learned on the textual data and often lack the domain knowledge of the entities to capture the context beyond the sentence. In this study, we introduced a novel framework that enables the model to learn multi-omn… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: To appear in Findings of ACL 2021

  48. arXiv:2107.00176  [pdf, other

    cs.CL

    Reinforcement Learning for Abstractive Question Summarization with Question-aware Semantic Rewards

    Authors: Shweta Yadav, Deepak Gupta, Asma Ben Abacha, Dina Demner-Fushman

    Abstract: The growth of online consumer health questions has led to the necessity for reliable and accurate question answering systems. A recent study showed that manual summarization of consumer health questions brings significant improvement in retrieving relevant answers. However, the automatic summarization of long questions is a challenging task due to the lack of training data and the complexity of th… ▽ More

    Submitted 30 June, 2021; originally announced July 2021.

    Comments: To appear at ACL 2021

  49. arXiv:2106.07257  [pdf

    cs.HC cs.AI

    Communication is the universal solvent: atreya bot -- an interactive bot for chemical scientists

    Authors: Mahak Sharma, Abhishek Kaushik, Rajesh Kumar, Sushant Kumar Rai, Harshada Hanumant Desai, Sargam Yadav

    Abstract: Conversational agents are a recent trend in human-computer interaction, deployed in multidisciplinary applications to assist the users. In this paper, we introduce "Atreya", an interactive bot for chemistry enthusiasts, researchers, and students to study the ChEMBL database. Atreya is hosted by Telegram, a popular cloud-based instant messaging application. This user-friendly bot queries the ChEMBL… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: IFIP 9.4 2021 1st Virtual Conference Conference Theme: Resilient ICT4D May 25th 28th, 2021

  50. arXiv:2106.00219  [pdf, ps, other

    cs.CL

    Question-aware Transformer Models for Consumer Health Question Summarization

    Authors: Shweta Yadav, Deepak Gupta, Asma Ben Abacha, Dina Demner-Fushman

    Abstract: Searching for health information online is becoming customary for more and more consumers every day, which makes the need for efficient and reliable question answering systems more pressing. An important contributor to the success rates of these systems is their ability to fully understand the consumers' questions. However, these questions are frequently longer than needed and mention peripheral i… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.