Skip to main content

Showing 1–50 of 74 results for author: Modi, A

  1. arXiv:2407.05887  [pdf, other

    cs.CL cs.AI cs.LG

    Generation and De-Identification of Indian Clinical Discharge Summaries using LLMs

    Authors: Sanjeet Singh, Shreya Gupta, Niralee Gupta, Naimish Sharma, Lokesh Srivastava, Vibhu Agarwal, Ashutosh Modi

    Abstract: The consequences of a healthcare data breach can be devastating for the patients, providers, and payers. The average financial impact of a data breach in recent months has been estimated to be close to USD 10 million. This is especially significant for healthcare organizations in India that are managing rapid digitization while still establishing data governance procedures that align with the lett… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted at BioNLP Workshop at ACL 2024; 21 pages (9 pages main content)

  2. arXiv:2407.05404  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    iSign: A Benchmark for Indian Sign Language Processing

    Authors: Abhinav Joshi, Romit Mohanty, Mounika Kanakanti, Andesha Mangla, Sudeep Choudhary, Monali Barbate, Ashutosh Modi

    Abstract: Indian Sign Language has limited resources for developing machine learning and data-driven approaches for automated language processing. Though text/audio-based language processing techniques have shown colossal research interest and tremendous improvements in the last few years, Sign Languages still need to catch up due to the need for more resources. To bridge this gap, in this work, we propose… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Accepted at ACL 2024 Findings. 18 Pages (9 Pages + References + Appendix)

  3. arXiv:2407.05399  [pdf, other

    cs.CL cs.AI cs.LG

    IL-TUR: Benchmark for Indian Legal Text Understanding and Reasoning

    Authors: Abhinav Joshi, Shounak Paul, Akshat Sharma, Pawan Goyal, Saptarshi Ghosh, Ashutosh Modi

    Abstract: Legal systems worldwide are inundated with exponential growth in cases and documents. There is an imminent need to develop NLP and ML techniques for automatically processing and understanding legal documents to streamline the legal system. However, evaluating and comparing various NLP models designed specifically for the legal domain is challenging. This paper addresses this challenge by proposing… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Accepted at ACL 2024 Main Conference; 40 Pages (9 Pages + References + Appendix)

  4. arXiv:2406.07860  [pdf, other

    cs.CL cs.AI cs.LG

    BookSQL: A Large Scale Text-to-SQL Dataset for Accounting Domain

    Authors: Rahul Kumar, Amar Raja Dibbu, Shrutendra Harsola, Vignesh Subrahmaniam, Ashutosh Modi

    Abstract: Several large-scale datasets (e.g., WikiSQL, Spider) for developing natural language interfaces to databases have recently been proposed. These datasets cover a wide breadth of domains but fall short on some essential domains, such as finance and accounting. Given that accounting databases are used worldwide, particularly by non-technical people, there is an imminent need to develop models that co… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted at NAACL 2024; 20 Pages (main + appendix)

  5. arXiv:2406.05828  [pdf, other

    cs.CV cs.AI eess.IV

    Multi-Stain Multi-Level Convolutional Network for Multi-Tissue Breast Cancer Image Segmentation

    Authors: Akash Modi, Sumit Kumar Jha, Purnendu Mishra, Rajiv Kumar, Kiran Aatre, Gursewak Singh, Shubham Mathur

    Abstract: Digital pathology and microscopy image analysis are widely employed in the segmentation of digitally scanned IHC slides, primarily to identify cancer and pinpoint regions of interest (ROI) indicative of tumor presence. However, current ROI segmentation models are either stain-specific or suffer from the issues of stain and scanner variance due to different staining protocols or modalities across m… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  6. arXiv:2404.04525  [pdf, other

    cs.CL cs.AI cs.LG

    IITK at SemEval-2024 Task 10: Who is the speaker? Improving Emotion Recognition and Flip Reasoning in Conversations via Speaker Embeddings

    Authors: Shubham Patel, Divyaksh Shukla, Ashutosh Modi

    Abstract: This paper presents our approach for the SemEval-2024 Task 10: Emotion Discovery and Reasoning its Flip in Conversations. For the Emotion Recognition in Conversations (ERC) task, we utilize a masked-memory network along with speaker participation. We propose a transformer-based speaker-centric model for the Emotion Flip Reasoning (EFR) task. We also introduce Probable Trigger Zone, a region of the… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: Accepted at SemEval 2024, NAACL 2024; 10 Pages

  7. arXiv:2404.04520  [pdf, other

    cs.CL cs.AI cs.LG

    IITK at SemEval-2024 Task 4: Hierarchical Embeddings for Detection of Persuasion Techniques in Memes

    Authors: Shreenaga Chikoti, Shrey Mehta, Ashutosh Modi

    Abstract: Memes are one of the most popular types of content used in an online disinformation campaign. They are primarily effective on social media platforms since they can easily reach many users. Memes in a disinformation campaign achieve their goal of influencing the users through several rhetorical and psychological techniques, such as causal oversimplification, name-calling, and smear. The SemEval 202… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: Accepted at SemEval 2024, NAACL 2024; 9 pages

  8. arXiv:2404.04513  [pdf, other

    cs.CL cs.AI cs.LG

    IITK at SemEval-2024 Task 1: Contrastive Learning and Autoencoders for Semantic Textual Relatedness in Multilingual Texts

    Authors: Udvas Basak, Rajarshi Dutta, Shivam Pandey, Ashutosh Modi

    Abstract: This paper describes our system developed for the SemEval-2024 Task 1: Semantic Textual Relatedness. The challenge is focused on automatically detecting the degree of relatedness between pairs of sentences for 14 languages including both high and low-resource Asian and African languages. Our team participated in two subtasks consisting of Track A: supervised and Track B: unsupervised. This paper f… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: Accepted at SemEval 2024, NAACL 2024; 6 pages

  9. arXiv:2404.04510  [pdf, other

    cs.CL cs.AI cs.LG

    IITK at SemEval-2024 Task 2: Exploring the Capabilities of LLMs for Safe Biomedical Natural Language Inference for Clinical Trials

    Authors: Shreyasi Mandal, Ashutosh Modi

    Abstract: Large Language models (LLMs) have demonstrated state-of-the-art performance in various natural language processing (NLP) tasks across multiple domains, yet they are prone to shortcut learning and factual inconsistencies. This research investigates LLMs' robustness, consistency, and faithful reasoning when performing Natural Language Inference (NLI) on breast cancer Clinical Trial Reports (CTRs) in… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: Accepted at SemEval 2024, NAACL 2024; 8 Pages

  10. arXiv:2403.15412  [pdf, other

    cs.CY cs.AI cs.CL

    Towards Measuring and Modeling "Culture" in LLMs: A Survey

    Authors: Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Singh, Alham Fikri Aji, Jacki O'Neill, Ashutosh Modi, Monojit Choudhury

    Abstract: We present a survey of more than 90 recent papers that aim to study cultural representation and inclusion in large language models (LLMs). We observe that none of the studies explicitly define "culture, which is a complex, multifaceted concept; instead, they probe the models on some specially designed datasets which represent certain aspects of "culture". We call these aspects the proxies of cultu… ▽ More

    Submitted 19 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  11. arXiv:2311.17578  [pdf

    cs.CR

    Data Driven Approaches to Cybersecurity Governance for Board Decision-Making -- A Systematic Review

    Authors: Anita Modi, Ievgeniia Kuzminykh, Bogdan Ghita

    Abstract: Cybersecurity governance influences the quality of strategic decision-making to ensure cyber risks are managed effectively. Board of Directors are the decisions-makers held accountable for managing this risk; however, they lack adequate and efficient information necessary for making such decisions. In addition to the myriad of challenges they face, they are often insufficiently versed in the techn… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  12. arXiv:2310.18974  [pdf, other

    cs.CL cs.AI cs.LG

    EtiCor: Corpus for Analyzing LLMs for Etiquettes

    Authors: Ashutosh Dwivedi, Pradhyumna Lavania, Ashutosh Modi

    Abstract: Etiquettes are an essential ingredient of day-to-day interactions among people. Moreover, etiquettes are region-specific, and etiquettes in one region might contradict those in other regions. In this paper, we propose EtiCor, an Etiquettes Corpus, having texts about social norms from five different regions across the globe. The corpus provides a test bed for evaluating LLMs for knowledge and under… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023, Main Conference

  13. arXiv:2307.05440  [pdf, other

    cs.CL cs.AI cs.LG

    ISLTranslate: Dataset for Translating Indian Sign Language

    Authors: Abhinav Joshi, Susmit Agrawal, Ashutosh Modi

    Abstract: Sign languages are the primary means of communication for many hard-of-hearing people worldwide. Recently, to bridge the communication gap between the hard-of-hearing community and the rest of the population, several sign language translation datasets have been proposed to enable the development of statistical sign language translation systems. However, there is a dearth of sign language resources… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: Accepted at ACL 2023 Findings, 8 Pages

  14. arXiv:2307.05260  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    U-CREAT: Unsupervised Case Retrieval using Events extrAcTion

    Authors: Abhinav Joshi, Akshat Sharma, Sai Kiran Tanikella, Ashutosh Modi

    Abstract: The task of Prior Case Retrieval (PCR) in the legal domain is about automatically citing relevant (based on facts and precedence) prior legal cases in a given query case. To further promote research in PCR, in this paper, we propose a new large benchmark (in English) for the PCR task: IL-PCR (Indian Legal Prior Case Retrieval) corpus. Given the complex nature of case relevance and the long size of… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: Accepted at ACL 2023, 15 pages (12 main + 3 Appendix)

  15. arXiv:2307.03906  [pdf, other

    cs.CL cs.AI cs.LG cs.MA

    ScriptWorld: Text Based Environment For Learning Procedural Knowledge

    Authors: Abhinav Joshi, Areeb Ahmad, Umang Pandey, Ashutosh Modi

    Abstract: Text-based games provide a framework for developing natural language understanding and commonsense knowledge about the world in reinforcement learning based agents. Existing text-based environments often rely on fictional situations and characters to create a gaming framework and are far from real-world scenarios. In this paper, we introduce ScriptWorld: a text-based environment for teaching agent… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: Accepted at IJCAI 2023, 26 Pages (7 main + 19 for appendix)

  16. arXiv:2305.08358  [pdf, other

    cs.CR cs.DC cs.LG

    Quadratic Functional Encryption for Secure Training in Vertical Federated Learning

    Authors: Shuangyi Chen, Anuja Modi, Shweta Agrawal, Ashish Khisti

    Abstract: Vertical federated learning (VFL) enables the collaborative training of machine learning (ML) models in settings where the data is distributed amongst multiple parties who wish to protect the privacy of their individual data. Notably, in VFL, the labels are available to a single party and the complete feature set is formed only when data from all parties is combined. Recently, Xu et al. proposed a… ▽ More

    Submitted 19 June, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: Accepted to ISIT 2023

  17. arXiv:2304.09548  [pdf, other

    cs.CL cs.AI cs.LG

    SemEval 2023 Task 6: LegalEval - Understanding Legal Texts

    Authors: Ashutosh Modi, Prathamesh Kalamkar, Saurabh Karn, Aman Tiwari, Abhinav Joshi, Sai Kiran Tanikella, Shouvik Kumar Guha, Sachin Malhan, Vivek Raghavan

    Abstract: In populous countries, pending legal cases have been growing exponentially. There is a need for developing NLP-based techniques for processing and automatically understanding legal documents. To promote research in the area of Legal NLP we organized the shared task LegalEval - Understanding Legal Texts at SemEval 2023. LegalEval task has three sub-tasks: Task-A (Rhetorical Roles Labeling) is about… ▽ More

    Submitted 1 May, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: 13 Pages (9 Pages + References), Accepted at SemEval 2023 at ACL 2023

  18. arXiv:2302.01061  [pdf

    cs.AI

    MLOps with enhanced performance control and observability

    Authors: Indradumna Banerjee, Dinesh Ghanta, Girish Nautiyal, Pradeep Sanchana, Prateek Katageri, Atin Modi

    Abstract: The explosion of data and its ever increasing complexity in the last few years, has made MLOps systems more prone to failure, and new tools need to be embedded in such systems to avoid such failure. In this demo, we will introduce crucial tools in the observability module of a MLOps system that target difficult issues like data drfit and model version control for optimum model selection. We believ… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: SECOND INTERNATIONAL CONFERENCE ON AI-ML SYSTEMS

  19. arXiv:2211.03742  [pdf, other

    cs.CL cs.AI cs.LG

    Multi-Task Learning Framework for Extracting Emotion Cause Span and Entailment in Conversations

    Authors: Ashwani Bhat, Ashutosh Modi

    Abstract: Predicting emotions expressed in text is a well-studied problem in the NLP community. Recently there has been active research in extracting the cause of an emotion expressed in text. Most of the previous work has done causal emotion entailment in documents. In this work, we propose neural models to extract emotion cause span and entailment in conversations. For learning such models, we use RECCON… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: 19 Pages, Accepted at Workshop on Transfer Learning for Natural Language Processing, NeurIPS 2022

  20. arXiv:2211.03587  [pdf, other

    cs.CV cs.AI cs.LG

    Generalized Product-of-Experts for Learning Multimodal Representations in Noisy Environments

    Authors: Abhinav Joshi, Naman Gupta, Jinang Shah, Binod Bhattarai, Ashutosh Modi, Danail Stoyanov

    Abstract: A real-world application or setting involves interaction between different modalities (e.g., video, speech, text). In order to process the multimodal information automatically and use it for an end application, Multimodal Representation Learning (MRL) has emerged as an active area of research in recent times. MRL involves learning reliable and robust representations of information from heterogeneo… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: 11 Pages, Accepted at ICMI 2022 Oral

  21. BabyNet: A Lightweight Network for Infant Reaching Action Recognition in Unconstrained Environments to Support Future Pediatric Rehabilitation Applications

    Authors: Amel Dechemi, Vikarn Bhakri, Ipsita Sahin, Arjun Modi, Julya Mestas, Pamodya Peiris, Dannya Enriquez Barrundia, Elena Kokkoni, Konstantinos Karydis

    Abstract: Action recognition is an important component to improve autonomy of physical rehabilitation devices, such as wearable robotic exoskeletons. Existing human action recognition algorithms focus on adult applications rather than pediatric ones. In this paper, we introduce BabyNet, a light-weight (in terms of trainable parameters) network structure to recognize infant reaching action from off-body stat… ▽ More

    Submitted 12 October, 2022; v1 submitted 9 August, 2022; originally announced August 2022.

    Comments: Accepted to RO-MAN 2021

  22. arXiv:2206.10770  [pdf, ps, other

    cs.LG cs.AI stat.ML

    On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL

    Authors: Jinglin Chen, Aditya Modi, Akshay Krishnamurthy, Nan Jiang, Alekh Agarwal

    Abstract: We study reward-free reinforcement learning (RL) under general non-linear function approximation, and establish sample efficiency and hardness results under various standard structural assumptions. On the positive side, we propose the RFOLIVE (Reward-Free OLIVE) algorithm for sample-efficient reward-free exploration under minimal structural assumptions, which covers the previously studied settings… ▽ More

    Submitted 22 October, 2022; v1 submitted 21 June, 2022; originally announced June 2022.

  23. arXiv:2205.02455  [pdf, other

    cs.CL cs.AI cs.LG

    COGMEN: COntextualized GNN based Multimodal Emotion recognitioN

    Authors: Abhinav Joshi, Ashwani Bhat, Ayush Jain, Atin Vikram Singh, Ashutosh Modi

    Abstract: Emotions are an inherent part of human interactions, and consequently, it is imperative to develop AI systems that understand and recognize human emotions. During a conversation involving various people, a person's emotions are influenced by the other speaker's utterances and their own emotional state over the utterances. In this paper, we propose COntextualized Graph Neural Network based Multimod… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: 17 pages (9 main + 8 appendix). Accepted at NAACL 2022

  24. arXiv:2204.00806  [pdf, other

    cs.CL cs.AI cs.LG

    HLDC: Hindi Legal Documents Corpus

    Authors: Arnav Kapoor, Mudit Dhawan, Anmol Goel, T. H. Arjun, Akshala Bhatnagar, Vibhu Agrawal, Amul Agrawal, Arnab Bhattacharya, Ponnurangam Kumaraguru, Ashutosh Modi

    Abstract: Many populous countries including India are burdened with a considerable backlog of legal cases. Development of automated systems that could process legal documents and augment legal practitioners can mitigate this. However, there is a dearth of high-quality corpora that is needed to develop such data-driven systems. The problem gets even more pronounced in the case of low resource languages such… ▽ More

    Submitted 24 May, 2024; v1 submitted 2 April, 2022; originally announced April 2022.

    Comments: 16 Pages, Accepted at ACL 2022 Findings

  25. arXiv:2201.13125  [pdf, other

    cs.CL cs.AI cs.LG

    Corpus for Automatic Structuring of Legal Documents

    Authors: Prathamesh Kalamkar, Aman Tiwari, Astha Agarwal, Saurabh Karn, Smita Gupta, Vivek Raghavan, Ashutosh Modi

    Abstract: In populous countries, pending legal cases have been growing exponentially. There is a need for developing techniques for processing and organizing legal documents. In this paper, we introduce a new corpus for structuring legal documents. In particular, we introduce a corpus of legal judgment documents in English that are segmented into topical and coherent parts. Each of these parts is annotated… ▽ More

    Submitted 19 September, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: Accepted at LREC 2022, 10 Pages (8 page main paper + 2 page references)

  26. arXiv:2201.01387  [pdf, other

    eess.SY cs.AI cs.LG stat.ME

    Joint Learning-Based Stabilization of Multiple Unknown Linear Systems

    Authors: Mohamad Kazem Shirani Faradonbeh, Aditya Modi

    Abstract: Learning-based control of linear systems received a lot of attentions recently. In popular settings, the true dynamical models are unknown to the decision-maker and need to be interactively learned by applying control inputs to the systems. Unlike the matured literature of efficient reinforcement learning policies for adaptive control of a single system, results on joint learning of multiple syste… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

  27. arXiv:2112.10955  [pdf, other

    stat.ML cs.LG eess.SY math.DS

    Joint Learning of Linear Time-Invariant Dynamical Systems

    Authors: Aditya Modi, Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis

    Abstract: Linear time-invariant systems are very popular models in system theory and applications. A fundamental problem in system identification that remains rather unaddressed in extant literature is to leverage commonalities amongst related linear systems to estimate their transition matrices more accurately. To address this problem, the current paper investigates methods for jointly estimating the trans… ▽ More

    Submitted 2 January, 2024; v1 submitted 20 December, 2021; originally announced December 2021.

  28. arXiv:2112.01938  [pdf, other

    cs.CL cs.AI cs.LG

    Shapes of Emotions: Multimodal Emotion Recognition in Conversations via Emotion Shifts

    Authors: Harsh Agarwal, Keshav Bansal, Abhinav Joshi, Ashutosh Modi

    Abstract: Emotion Recognition in Conversations (ERC) is an important and active research area. Recent work has shown the benefits of using multiple modalities (e.g., text, audio, and video) for the ERC task. In a conversation, participants tend to maintain a particular emotional state unless some stimuli evokes a change. There is a continuous ebb and flow of emotions in a conversation. Inspired by this obse… ▽ More

    Submitted 7 November, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

    Comments: 13 pages, Accepted at Workshop on Performance and Interpretability Evaluations of Multimodal, Multipurpose, Massive-Scale Models, COLING 2022

  29. arXiv:2112.01836  [pdf, other

    cs.CL cs.AI cs.LG

    Semantic Segmentation of Legal Documents via Rhetorical Roles

    Authors: Vijit Malik, Rishabh Sanjay, Shouvik Kumar Guha, Angshuman Hazarika, Shubham Nigam, Arnab Bhattacharya, Ashutosh Modi

    Abstract: Legal documents are unstructured, use legal jargon, and have considerable length, making them difficult to process automatically via conventional text processing techniques. A legal document processing system would benefit substantially if the documents could be segmented into coherent information units. This paper proposes a new corpus of legal documents annotated (with the help of legal experts)… ▽ More

    Submitted 7 November, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

    Comments: 19 pages, Accepted at Natural Legal Language Processing Workshop, EMNLP 2022

  30. arXiv:2109.07763  [pdf, other

    eess.SP cs.IT

    Design and Evaluation of Reconfigurable Intelligent Surfaces in Real-World Environment

    Authors: Georgios C. Trichopoulos, Panagiotis Theofanopoulos, Bharath Kashyap, Aditya Shekhawat, Anuj Modi, Tawfik Osman, Sanjay Kumar, Anand Sengar, Arkajyoti Chang, Ahmed Alkhateeb

    Abstract: Reconfigurable intelligent surfaces (RISs) have promising coverage and data rate gains for wireless communication systems in 5G and beyond. Prior work has mainly focused on analyzing the performance of these surfaces using computer simulations or lab-level prototypes. To draw accurate insights about the actual performance of these systems, this paper develops an RIS proof-of-concept prototype and… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: Submitted to IEEE Open Journal of the Communications Society, 29 pages, 20 figures

  31. arXiv:2107.12135  [pdf, other

    cs.CL cs.AI

    Fine-Grained Emotion Prediction by Modeling Emotion Definitions

    Authors: Gargi Singh, Dhanajit Brahma, Piyush Rai, Ashutosh Modi

    Abstract: In this paper, we propose a new framework for fine-grained emotion prediction in the text through emotion definition modeling. Our approach involves a multi-task learning framework that models definitions of emotions as an auxiliary task while being trained on the primary task of emotion prediction. We model definitions using masked language modeling and class definition prediction tasks. Our mode… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

    Comments: 8 Pages, accepted at ACII 2021 for Orals

  32. arXiv:2107.08408  [pdf, other

    cs.CL cs.AI cs.MA cs.RO

    Pre-trained Language Models as Prior Knowledge for Playing Text-based Games

    Authors: Ishika Singh, Gargi Singh, Ashutosh Modi

    Abstract: Recently, text world games have been proposed to enable artificial agents to understand and reason about real-world scenarios. These text-based games are challenging for artificial agents, as it requires an understanding of and interaction using natural language in a partially observable environment. Agents observe the environment via textual descriptions designed to be challenging enough for even… ▽ More

    Submitted 23 December, 2021; v1 submitted 18 July, 2021; originally announced July 2021.

    Comments: 40 Pages (8 Pages main content + 1 Page references + 31 Pages Appendix). Some new results added

  33. arXiv:2107.05202  [pdf, other

    cs.CV

    Delta Sampling R-BERT for limited data and low-light action recognition

    Authors: Sanchit Hira, Ritwik Das, Abhinav Modi, Daniil Pakhomov

    Abstract: We present an approach to perform supervised action recognition in the dark. In this work, we present our results on the ARID dataset. Most previous works only evaluate performance on large, well illuminated datasets like Kinetics and HMDB51. We demonstrate that our work is able to achieve a very low error rate while being trained on a much smaller dataset of dark videos. We also explore a variety… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

  34. KEA: Tuning an Exabyte-Scale Data Infrastructure

    Authors: Yiwen Zhu, Subru Krishnan, Konstantinos Karanasos, Isha Tarte, Conor Power, Abhishek Modi, Manoj Kumar, Deli Zhang, Kartheek Muthyala, Nick Jurgens, Sarvesh Sakalanaga, Sudhir Darbha, Minu Iyer, Ankita Agarwal, Carlo Curino

    Abstract: Microsoft's internal big-data infrastructure is one of the largest in the world -- with over 300k machines running billions of tasks from over 0.6M daily jobs. Operating this infrastructure is a costly and complex endeavor, and efficiency is paramount. In fact, for over 15 years, a dedicated engineering team has tuned almost every aspect of this infrastructure, achieving state-of-the-art efficienc… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

  35. arXiv:2105.13562  [pdf, other

    cs.CL cs.AI

    ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction and Explanation

    Authors: Vijit Malik, Rishabh Sanjay, Shubham Kumar Nigam, Kripa Ghosh, Shouvik Kumar Guha, Arnab Bhattacharya, Ashutosh Modi

    Abstract: An automated system that could assist a judge in predicting the outcome of a case would help expedite the judicial process. For such a system to be practically useful, predictions by the system should be explainable. To promote research in developing such a system, we introduce ILDC (Indian Legal Documents Corpus). ILDC is a large corpus of 35k Indian Supreme Court cases annotated with original co… ▽ More

    Submitted 31 May, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: Accepted at ACL 2021, 17 Pages (9 Pages main paper, 4 pages references, 4 pages appendix)

  36. arXiv:2105.05621  [pdf, other

    cs.CL cs.AI

    NLP for Climate Policy: Creating a Knowledge Platform for Holistic and Effective Climate Action

    Authors: Pradip Swarnakar, Ashutosh Modi

    Abstract: Climate change is a burning issue of our time, with the Sustainable Development Goal (SDG) 13 of the United Nations demanding global climate action. Realizing the urgency, in 2015 in Paris, world leaders signed an agreement committing to taking voluntary action to reduce carbon emissions. However, the scale, magnitude, and climate action processes vary globally, especially between developed and de… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: 12 Pages (8 + 4 pages for references)

  37. arXiv:2104.03071  [pdf, other

    cs.CL

    BreakingBERT@IITK at SemEval-2021 Task 9 : Statement Verification and Evidence Finding with Tables

    Authors: Aditya Jindal, Ankur Gupta, Jaya Srivastava, Preeti Menghwani, Vijit Malik, Vishesh Kaushik, Ashutosh Modi

    Abstract: Recently, there has been an interest in factual verification and prediction over structured data like tables and graphs. To circumvent any false news incident, it is necessary to not only model and predict over structured data efficiently but also to explain those predictions. In this paper, as part of the SemEval-2021 Task 9, we tackle the problem of fact verification and evidence finding over ta… ▽ More

    Submitted 10 April, 2021; v1 submitted 7 April, 2021; originally announced April 2021.

    Comments: Accepted at SemEval 2021 Task 9, 11 Pages (8 Pages main content+ 1 pages for references + 2 Pages Appendix)

  38. arXiv:2104.01619  [pdf, other

    cs.CL

    KnowGraph@IITK at SemEval-2021 Task 11: Building KnowledgeGraph for NLP Research

    Authors: Shashank Shailabh, Sajal Chaurasia, Ashutosh Modi

    Abstract: Research in Natural Language Processing is making rapid advances, resulting in the publication of a large number of research papers. Finding relevant research papers and their contribution to the domain is a challenging problem. In this paper, we address this challenge via the SemEval 2021 Task 11: NLPContributionGraph, by developing a system for a research paper contributions-focused knowledge gr… ▽ More

    Submitted 4 April, 2021; originally announced April 2021.

    Comments: Accepted at SemEval 2021 Task 11, 11 Pages (9 Pages main content+ 2 pages for references)

  39. arXiv:2104.01567  [pdf, other

    cs.CL

    MCL@IITK at SemEval-2021 Task 2: Multilingual and Cross-lingual Word-in-Context Disambiguation using Augmented Data, Signals, and Transformers

    Authors: Rohan Gupta, Jay Mundra, Deepak Mahajan, Ashutosh Modi

    Abstract: In this work, we present our approach for solving the SemEval 2021 Task 2: Multilingual and Cross-lingual Word-in-Context Disambiguation (MCL-WiC). The task is a sentence pair classification problem where the goal is to detect whether a given word common to both the sentences evokes the same meaning. We submit systems for both the settings - Multilingual (the pair's sentences belong to the same la… ▽ More

    Submitted 4 April, 2021; originally announced April 2021.

    Comments: Accepted at SemEval 2021 Task 2, 10 Pages (8 Pages main content+ 2 pages for references)

  40. arXiv:2104.01566  [pdf, other

    cs.CL

    IITK@Detox at SemEval-2021 Task 5: Semi-Supervised Learning and Dice Loss for Toxic Spans Detection

    Authors: Archit Bansal, Abhay Kaushik, Ashutosh Modi

    Abstract: In this work, we present our approach and findings for SemEval-2021 Task 5 - Toxic Spans Detection. The task's main aim was to identify spans to which a given text's toxicity could be attributed. The task is challenging mainly due to two constraints: the small training dataset and imbalanced class distribution. Our paper investigates two techniques, semi-supervised learning and learning with Self-… ▽ More

    Submitted 4 April, 2021; originally announced April 2021.

    Comments: Accepted at SemEval 2021 Task 5, 9 Pages (6 Pages main content + 1 Page for references + 2 Pages Appendix)

  41. arXiv:2104.01563  [pdf, other

    cs.CL cs.AI cs.LG

    ReCAM@IITK at SemEval-2021 Task 4: BERT and ALBERT based Ensemble for Abstract Word Prediction

    Authors: Abhishek Mittal, Ashutosh Modi

    Abstract: This paper describes our system for Task 4 of SemEval-2021: Reading Comprehension of Abstract Meaning (ReCAM). We participated in all subtasks where the main goal was to predict an abstract word missing from a statement. We fine-tuned the pre-trained masked language models namely BERT and ALBERT and used an Ensemble of these as our submitted system on Subtask 1 (ReCAM-Imperceptibility) and Subtask… ▽ More

    Submitted 4 April, 2021; originally announced April 2021.

    Comments: Accepted at SemEval 2021 Task 4, 8 Pages (7 Pages main content + 1 pages for references)

  42. arXiv:2104.01364  [pdf, other

    cs.CL

    Counts@IITK at SemEval-2021 Task 8: SciBERT Based Entity And Semantic Relation Extraction For Scientific Data

    Authors: Akash Gangwar, Sabhay Jain, Shubham Sourav, Ashutosh Modi

    Abstract: This paper presents the system for SemEval 2021 Task 8 (MeasEval). MeasEval is a novel span extraction, classification, and relation extraction task focused on finding quantities, attributes of these quantities, and additional information, including the related measured entities, properties, and measurement contexts. Our submitted system, which placed fifth (team rank) on the leaderboard, consiste… ▽ More

    Submitted 3 April, 2021; originally announced April 2021.

    Comments: Accepted at SemEval 2021 Task 8, 7 Pages (5 Pages main content + 1 page for references + 1 Page Appendix)

  43. arXiv:2104.01046  [pdf, other

    cs.CL

    IITK@LCP at SemEval 2021 Task 1: Classification for Lexical Complexity Regression Task

    Authors: Neil Rajiv Shirude, Sagnik Mukherjee, Tushar Shandhilya, Ananta Mukherjee, Ashutosh Modi

    Abstract: This paper describes our contribution to SemEval 2021 Task 1: Lexical Complexity Prediction. In our approach, we leverage the ELECTRA model and attempt to mirror the data annotation scheme. Although the task is a regression task, we show that we can treat it as an aggregation of several classification and regression models. This somewhat counter-intuitive approach achieved an MAE score of 0.0654 f… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

    Comments: Accepted at SemEval 2021 Task 1, 7 Pages (5 Pages main content+ 2 pages for reference)

  44. arXiv:2104.00933  [pdf, other

    cs.CL

    Humor@IITK at SemEval-2021 Task 7: Large Language Models for Quantifying Humor and Offensiveness

    Authors: Aishwarya Gupta, Avik Pal, Bholeshwar Khurana, Lakshay Tyagi, Ashutosh Modi

    Abstract: Humor and Offense are highly subjective due to multiple word senses, cultural knowledge, and pragmatic competence. Hence, accurately detecting humorous and offensive texts has several compelling use cases in Recommendation Systems and Personalized Content Moderation. However, due to the lack of an extensive labeled dataset, most prior works in this domain haven't explored large neural models for s… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

    Comments: Accepted at SemEval 2021 Task 7, 7 Pages (6 Pages main content + 2 pages for references)

  45. arXiv:2103.01544  [pdf, other

    cs.CL cs.AI cs.LG

    An End-to-End Network for Emotion-Cause Pair Extraction

    Authors: Aaditya Singh, Shreeshail Hingane, Saim Wani, Ashutosh Modi

    Abstract: The task of Emotion-Cause Pair Extraction (ECPE) aims to extract all potential clause-pairs of emotions and their corresponding causes in a document. Unlike the more well-studied task of Emotion Cause Extraction (ECE), ECPE does not require the emotion clauses to be provided as annotations. Previous works on ECPE have either followed a multi-stage approach where emotion extraction, cause extractio… ▽ More

    Submitted 3 March, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: Accepted at WASSA-2021, 5 Pages + 2 Pages (references) + 2 Pages (Appendix)

  46. arXiv:2102.07035  [pdf, other

    cs.LG stat.ML

    Model-free Representation Learning and Exploration in Low-rank MDPs

    Authors: Aditya Modi, Jinglin Chen, Akshay Krishnamurthy, Nan Jiang, Alekh Agarwal

    Abstract: The low rank MDP has emerged as an important model for studying representation learning and exploration in reinforcement learning. With a known representation, several model-free exploration strategies exist. In contrast, all algorithms for the unknown representation setting are model-based, thereby requiring the ability to model the full dynamics. In this work, we present the first model-free rep… ▽ More

    Submitted 21 June, 2022; v1 submitted 13 February, 2021; originally announced February 2021.

    Comments: Changelog v2: Significant reorganization of the paper, added an improved analysis of elliptic planner and updated discussion wrt follow-up work

  47. arXiv:2101.08523  [pdf, other

    cs.CL cs.AI cs.LG

    Adv-OLM: Generating Textual Adversaries via OLM

    Authors: Vijit Malik, Ashwani Bhat, Ashutosh Modi

    Abstract: Deep learning models are susceptible to adversarial examples that have imperceptible perturbations in the original input, resulting in adversarial attacks against these models. Analysis of these attacks on the state of the art transformers in NLP can help improve the robustness of these models against such adversarial inputs. In this paper, we present Adv-OLM, a black-box attack method that adapts… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

    Comments: 5 Pages + 1 Page references + 3 Pages Appendix, Accepted at EACL 2021

  48. arXiv:2011.04000  [pdf, other

    cs.CL cs.AI cs.LG

    Adapting a Language Model for Controlled Affective Text Generation

    Authors: Ishika Singh, Ahsan Barkati, Tushar Goswamy, Ashutosh Modi

    Abstract: Human use language not just to convey information but also to express their inner feelings and mental states. In this work, we adapt the state-of-the-art language generation models to generate affective (emotional) text. We posit a model capable of generating affect-driven and topic-focused sentences without losing grammatical correctness as the affect intensity increases. We propose to incorporat… ▽ More

    Submitted 8 November, 2020; originally announced November 2020.

    Comments: 15 Pages (9 + 2 (references) + 4 (appendix)), accepted at COLING 2020

  49. arXiv:2007.15619  [pdf, other

    cs.CY cs.CL cs.LG

    AI-based Monitoring and Response System for Hospital Preparedness towards COVID-19 in Southeast Asia

    Authors: Tushar Goswamy, Naishadh Parmar, Ayush Gupta, Raunak Shah, Vatsalya Tandon, Varun Goyal, Sanyog Gupta, Karishma Laud, Shivam Gupta, Sudhanshu Mishra, Ashutosh Modi

    Abstract: This research paper proposes a COVID-19 monitoring and response system to identify the surge in the volume of patients at hospitals and shortage of critical equipment like ventilators in South-east Asian countries, to understand the burden on health facilities. This can help authorities in these regions with resource planning measures to redirect resources to the regions identified by the model. D… ▽ More

    Submitted 5 September, 2022; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: 5 pages, 5 figures. Accepted to the ICML 2020 Workshop on Healthcare Systems, Population Health, and the Role of Health-Tech

  50. arXiv:2007.12678  [pdf, other

    cs.LG cs.AI stat.ML

    Clinician-in-the-Loop Decision Making: Reinforcement Learning with Near-Optimal Set-Valued Policies

    Authors: Shengpu Tang, Aditya Modi, Michael W. Sjoding, Jenna Wiens

    Abstract: Standard reinforcement learning (RL) aims to find an optimal policy that identifies the best action for each state. However, in healthcare settings, many actions may be near-equivalent with respect to the reward (e.g., survival). We consider an alternative objective -- learning set-valued policies to capture near-equivalent actions that lead to similar cumulative rewards. We propose a model-free a… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

    Comments: ICML 2020. Code available at https://github.com/shengpu1126/RL-Set-Valued-Policy