Skip to main content

Showing 1–19 of 19 results for author: Mondal, I

  1. arXiv:2406.16342  [pdf, other

    cs.CL

    ADVSCORE: A Metric for the Evaluation and Creation of Adversarial Benchmarks

    Authors: Yoo Yeon Sung, Eve Fleisig, Ishani Mondal, Jordan Lee Boyd-Graber

    Abstract: Adversarial benchmarks validate model abilities by providing samples that fool models but not humans. However, despite the proliferation of datasets that claim to be adversarial, there does not exist an established metric to evaluate how adversarial these datasets are. To address this lacuna, we introduce ADVSCORE, a metric which quantifies how adversarial and discriminative an adversarial dataset… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2401.11185

  2. arXiv:2404.05088  [pdf, other

    cs.CL

    How much reliable is ChatGPT's prediction on Information Extraction under Input Perturbations?

    Authors: Ishani Mondal, Abhilasha Sancheti

    Abstract: In this paper, we assess the robustness (reliability) of ChatGPT under input perturbations for one of the most fundamental tasks of Information Extraction (IE) i.e. Named Entity Recognition (NER). Despite the hype, the majority of the researchers have vouched for its language understanding and generation capabilities; a little attention has been paid to understand its robustness: How the input-per… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 3 Figures, 7 Tables

  3. arXiv:2402.11161  [pdf, other

    cs.CL cs.AI

    PEDANTS (Precise Evaluations of Diverse Answer Nominee Text for Skinflints): Efficient Evaluation Analysis and Benchmarking for Open-Domain Question Answering

    Authors: Zongxia Li, Ishani Mondal, Yijun Liang, Huy Nghiem, Jordan Lee Boyd-Graber

    Abstract: Question answering (QA) can only make progress if we know if an answer is correct, but for many of the most challenging and interesting QA examples, current efficient answer correctness (AC) metrics do not align with human judgments, particularly verbose, free-form answers from large language models (LLMs). There are two challenges: a lack of diverse evaluation data and that models are too big and… ▽ More

    Submitted 6 July, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Efficient PEDANTS Classifier for short-form QA in github: https://github.com/zli12321/qa_metrics. arXiv admin note: text overlap with arXiv:2401.13170

  4. arXiv:2401.15579  [pdf, other

    cs.CL cs.SD eess.AS

    MunTTS: A Text-to-Speech System for Mundari

    Authors: Varun Gumma, Rishav Hada, Aditya Yadavalli, Pamir Gogoi, Ishani Mondal, Vivek Seshadri, Kalika Bali

    Abstract: We present MunTTS, an end-to-end text-to-speech (TTS) system specifically for Mundari, a low-resource Indian language of the Austo-Asiatic family. Our work addresses the gap in linguistic technology for underrepresented languages by collecting and processing data to build a speech synthesis system. We begin our study by gathering a substantial dataset of Mundari text and speech and train end-to-en… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: Accepted to ComputEL-7

  5. arXiv:2401.13170   

    cs.CL

    CFMatch: Aligning Automated Answer Equivalence Evaluation with Expert Judgments For Open-Domain Question Answering

    Authors: Zongxia Li, Ishani Mondal, Yijun Liang, Huy Nghiem, Jordan Boyd-Graber

    Abstract: Question answering (QA) can only make progress if we know if an answer is correct, but for many of the most challenging and interesting QA examples, current evaluation metrics to determine answer equivalence (AE) often do not align with human judgments, particularly more verbose, free-form answers from large language models (LLM). There are two challenges: a lack of data and that models are too bi… ▽ More

    Submitted 29 June, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: A duplicate and polished version is in arXiv:2402.11161

  6. arXiv:2401.11185  [pdf, other

    cs.CL cs.HC

    How the Advent of Ubiquitous Large Language Models both Stymie and Turbocharge Dynamic Adversarial Question Generation

    Authors: Yoo Yeon Sung, Ishani Mondal, Jordan Boyd-Graber

    Abstract: Dynamic adversarial question generation, where humans write examples to stump a model, aims to create examples that are realistic and informative. However, the advent of large language models (LLMs) has been a double-edged sword for human authors: more people are interested in seeing and pushing the limits of these models, but because the models are so much stronger an opponent, they are harder to… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  7. arXiv:2305.14659  [pdf, other

    cs.CL

    InteractiveIE: Towards Assessing the Strength of Human-AI Collaboration in Improving the Performance of Information Extraction

    Authors: Ishani Mondal, Michelle Yuan, Anandhavelu N, Aparna Garimella, Francis Ferraro, Andrew Blair-Stanek, Benjamin Van Durme, Jordan Boyd-Graber

    Abstract: Learning template based information extraction from documents is a crucial yet difficult task. Prior template-based IE approaches assume foreknowledge of the domain templates; however, real-world IE do not have pre-defined schemas and it is a figure-out-as you go phenomena. To quickly bootstrap templates in a real-world setting, we need to induce template slots from documents with zero or minimal… ▽ More

    Submitted 17 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Version 2

  8. arXiv:2302.09685  [pdf, other

    cs.IR cs.CL

    Intent Identification and Entity Extraction for Healthcare Queries in Indic Languages

    Authors: Ankan Mullick, Ishani Mondal, Sourjyadip Ray, R Raghav, G Sai Chaitanya, Pawan Goyal

    Abstract: Scarcity of data and technological limitations for resource-poor languages in developing countries like India poses a threat to the development of sophisticated NLU systems for healthcare. To assess the current status of various state-of-the-art language models in healthcare, this paper studies the problem by initially proposing two different Healthcare datasets, Indian Healthcare Query Intent-Web… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

    Journal ref: EACL 2023 Findings Full Paper

  9. arXiv:2211.11049  [pdf, other

    cs.CL cs.AI

    Explaining (Sarcastic) Utterances to Enhance Affect Understanding in Multimodal Dialogues

    Authors: Shivani Kumar, Ishani Mondal, Md Shad Akhtar, Tanmoy Chakraborty

    Abstract: Conversations emerge as the primary media for exchanging ideas and conceptions. From the listener's perspective, identifying various affective qualities, such as sarcasm, humour, and emotions, is paramount for comprehending the true connotation of the emitted utterance. However, one of the major hurdles faced in learning these affect dimensions is the presence of figurative language, viz. irony, m… ▽ More

    Submitted 22 November, 2022; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: Accepted at AAAI 2023. 11 Pages; 14 Tables; 3 Figures

  10. arXiv:2204.07705  [pdf, other

    cs.CL cs.AI

    Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

    Authors: Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi, Yeganeh Kordi, Amirreza Mirzaei, Anjana Arunkumar, Arjun Ashok, Arut Selvan Dhanasekaran, Atharva Naik, David Stap, Eshaan Pathak, Giannis Karamanolakis, Haizhi Gary Lai, Ishan Purohit, Ishani Mondal, Jacob Anderson, Kirby Kuznia, Krima Doshi, Maitreya Patel, Kuntal Kumar Pal, Mehrad Moradshahi, Mihir Parmar, Mirali Purohit, Neeraj Varshney, Phani Rohitha Kaza , et al. (15 additional authors not shown)

    Abstract: How well can NLP models generalize to a variety of unseen tasks when provided with task instructions? To address this question, we first introduce Super-NaturalInstructions, a benchmark of 1,616 diverse NLP tasks and their expert-written instructions. Our collection covers 76 distinct task types, including but not limited to classification, extraction, infilling, sequence tagging, text rewriting,… ▽ More

    Submitted 24 October, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: Accepted to EMNLP 2022, 25 pages

  11. arXiv:2204.02790  [pdf, other

    cs.CY cs.CL

    Global Readiness of Language Technology for Healthcare: What would it Take to Combat the Next Pandemic?

    Authors: Ishani Mondal, Kabir Ahuja, Mohit Jain, Jacki O Neil, Kalika Bali, Monojit Choudhury

    Abstract: The COVID-19 pandemic has brought out both the best and worst of language technology (LT). On one hand, conversational agents for information dissemination and basic diagnosis have seen widespread use, and arguably, had an important role in combating the pandemic. On the other hand, it has also become clear that such technologies are readily available for a handful of languages, and the vast major… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: Under Revision

  12. arXiv:2110.01951  [pdf, other

    cs.LG cs.CL

    Multi-Objective Few-shot Learning for Fair Classification

    Authors: Ishani Mondal, Procheta Sen, Debasis Ganguly

    Abstract: In this paper, we propose a general framework for mitigating the disparities of the predicted classes with respect to secondary attributes within the data (e.g., race, gender etc.). Our proposed method involves learning a multi-objective function that in addition to learning the primary objective of predicting the primary class labels from the data, also employs a clustering-based heuristic to min… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: Accepted as a short paper in CIKM 2021

  13. arXiv:2108.07793   

    cs.CY

    Modeling Pedagogical Learning Environment with Hybrid Model based on ICT

    Authors: Al Maruf Hassan, Istiak Ahmed Mondal

    Abstract: Pedagogy is a method that handles the ethos and culture of instruction from educators and the learning of learners. Pedagogy of Information and Communications Technology (ICT) refers to the interactions among the teacher, children, and learning environment based on ICT. It is a discipline that deals with the theory and practice of teaching strategies, teaching actions, teaching judgments, and deci… ▽ More

    Submitted 27 August, 2021; v1 submitted 9 August, 2021; originally announced August 2021.

    Comments: Problem has been solved related to the problem has been solved related to a big error in the basic concept

  14. arXiv:2106.01167  [pdf, other

    cs.CL

    End-to-End NLP Knowledge Graph Construction

    Authors: Ishani Mondal, Yufang Hou, Charles Jochim

    Abstract: This paper studies the end-to-end construction of an NLP Knowledge Graph (KG) from scientific papers. We focus on extracting four types of relations: evaluatedOn between tasks and datasets, evaluatedBy between tasks and evaluation metrics, as well as coreferent and related relations between the same type of entities. For instance, F1-score is coreferent with F-measure. We introduce novel methods f… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted in ACL 2021

  15. arXiv:2104.01782  [pdf, other

    cs.CL

    BBAEG: Towards BERT-based Biomedical Adversarial Example Generation for Text Classification

    Authors: Ishani Mondal

    Abstract: Healthcare predictive analytics aids medical decision-making, diagnosis prediction and drug review analysis. Therefore, prediction accuracy is an important criteria which also necessitates robust predictive language models. However, the models using deep learning have been proven vulnerable towards insignificantly perturbed input instances which are less likely to be misclassified by humans. Recen… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: To appear in NAACL 2021

  16. arXiv:2012.11599  [pdf, other

    cs.CL cs.AI

    BERTChem-DDI : Improved Drug-Drug Interaction Prediction from text using Chemical Structure Information

    Authors: Ishani Mondal

    Abstract: Traditional biomedical version of embeddings obtained from pre-trained language models have recently shown state-of-the-art results for relation extraction (RE) tasks in the medical domain. In this paper, we explore how to incorporate domain knowledge, available in the form of molecular structure of drugs, for predicting Drug-Drug Interaction from textual corpus. We propose a method, BERTChem-DDI,… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2012.11142

  17. Medical Entity Linking using Triplet Network

    Authors: Ishani Mondal, Sukannya Purkayastha, Sudeshna Sarkar, Pawan Goyal, Jitesh Pillai, Amitava Bhattacharyya, Mahanandeeshwar Gattu

    Abstract: Entity linking (or Normalization) is an essential task in text mining that maps the entity mentions in the medical text to standard entities in a given Knowledge Base (KB). This task is of great importance in the medical domain. It can also be used for merging different medical and clinical ontologies. In this paper, we center around the problem of disease linking or normalization. This task is ex… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

    Comments: ClinicalNLP@NAACL 2019

  18. arXiv:2012.11142  [pdf, ps, other

    cs.CL cs.AI

    Towards Incorporating Entity-specific Knowledge Graph Information in Predicting Drug-Drug Interactions

    Authors: Ishani Mondal

    Abstract: Off-the-shelf biomedical embeddings obtained from the recently released various pre-trained language models (such as BERT, XLNET) have demonstrated state-of-the-art results (in terms of accuracy) for the various natural language understanding tasks (NLU) in the biomedical domain. Relation Classification (RC) falls into one of the most critical tasks. In this paper, we explore how to incorporate do… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

  19. arXiv:2009.00859  [pdf, other

    cs.CV cs.AI

    ALEX: Active Learning based Enhancement of a Model's Explainability

    Authors: Ishani Mondal, Debasis Ganguly

    Abstract: An active learning (AL) algorithm seeks to construct an effective classifier with a minimal number of labeled examples in a bootstrapping manner. While standard AL heuristics, such as selecting those points for annotation for which a classification model yields least confident predictions, there has been no empirical investigation to see if these heuristics lead to models that are more interpretab… ▽ More

    Submitted 2 September, 2020; originally announced September 2020.

    Comments: CIKM 2020