Skip to main content

Showing 1–45 of 45 results for author: Li, I

  1. arXiv:2407.10794  [pdf, other

    cs.CL cs.AI

    Graphusion: Leveraging Large Language Models for Scientific Knowledge Graph Fusion and Construction in NLP Education

    Authors: Rui Yang, Boming Yang, Sixun Ouyang, Tianwei She, Aosong Feng, Yuang Jiang, Freddy Lecue, Jinghui Lu, Irene Li

    Abstract: Knowledge graphs (KGs) are crucial in the field of artificial intelligence and are widely applied in downstream tasks, such as enhancing Question Answering (QA) systems. The construction of KGs typically requires significant effort from domain experts. Recently, Large Language Models (LLMs) have been used for knowledge graph construction (KGC), however, most existing approaches focus on a local pe… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 24 pages, 11 figures, 13 tables. arXiv admin note: substantial text overlap with arXiv:2402.14293

  2. arXiv:2406.08837  [pdf

    eess.IV cs.CV cs.LG

    Research on Deep Learning Model of Feature Extraction Based on Convolutional Neural Network

    Authors: Houze Liu, Iris Li, Yaxin Liang, Dan Sun, Yining Yang, Haowei Yang

    Abstract: Neural networks with relatively shallow layers and simple structures may have limited ability in accurately identifying pneumonia. In addition, deep neural networks also have a large demand for computing resources, which may cause convolutional neural networks to be unable to be implemented on terminals. Therefore, this paper will carry out the optimal classification of convolutional neural networ… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2403.05881  [pdf, other

    cs.CL

    KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques

    Authors: Rui Yang, Haoran Liu, Edison Marrese-Taylor, Qingcheng Zeng, Yu He Ke, Wanxin Li, Lechao Cheng, Qingyu Chen, James Caverlee, Yutaka Matsuo, Irene Li

    Abstract: Large language models (LLMs) have demonstrated impressive generative capabilities with the potential to innovate in medicine. However, the application of LLMs in real clinical settings remains challenging due to the lack of factual consistency in the generated content. In this work, we develop an augmented LLM framework, KG-Rank, which leverages a medical knowledge graph (KG) along with ranking an… ▽ More

    Submitted 4 July, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

    Comments: 12 pages, 9 figures, 8 tables

  4. arXiv:2402.17019  [pdf, other

    cs.CL cs.HC

    Leveraging Large Language Models for Learning Complex Legal Concepts through Storytelling

    Authors: Hang Jiang, Xiajie Zhang, Robert Mahari, Daniel Kessler, Eric Ma, Tal August, Irene Li, Alex 'Sandy' Pentland, Yoon Kim, Deb Roy, Jad Kabbara

    Abstract: Making legal knowledge accessible to non-experts is crucial for enhancing general legal literacy and encouraging civic participation in democracy. However, legal documents are often challenging to understand for people without legal backgrounds. In this paper, we present a novel application of large language models (LLMs) in legal education to help non-experts learn intricate legal concepts throug… ▽ More

    Submitted 2 July, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL 2024

  5. arXiv:2402.14293  [pdf, other

    cs.CL

    Leveraging Large Language Models for Concept Graph Recovery and Question Answering in NLP Education

    Authors: Rui Yang, Boming Yang, Sixun Ouyang, Tianwei She, Aosong Feng, Yuang Jiang, Freddy Lecue, Jinghui Lu, Irene Li

    Abstract: In the domain of Natural Language Processing (NLP), Large Language Models (LLMs) have demonstrated promise in text-generation tasks. However, their educational applications, particularly for domain-specific queries, remain underexplored. This study investigates LLMs' capabilities in educational scenarios, focusing on concept graph recovery and question-answering (QA). We assess LLMs' zero-shot per… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  6. arXiv:2401.09972  [pdf, other

    cs.CL

    Better Explain Transformers by Illuminating Important Information

    Authors: Linxin Song, Yan Cui, Ao Luo, Freddy Lecue, Irene Li

    Abstract: Transformer-based models excel in various natural language processing (NLP) tasks, attracting countless efforts to explain their inner workings. Prior methods explain Transformers by focusing on the raw gradient and attention as token attribution scores, where non-relevant information is often considered during explanation computation, resulting in confusing results. In this work, we propose highl… ▽ More

    Submitted 26 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  7. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  8. arXiv:2312.10463  [pdf, other

    cs.IR

    RecPrompt: A Prompt Tuning Framework for News Recommendation Using Large Language Models

    Authors: Dairui Liu, Boming Yang, Honghui Du, Derek Greene, Aonghus Lawlor, Ruihai Dong, Irene Li

    Abstract: In the evolving field of personalized news recommendation, understanding the semantics of the underlying data is crucial. Large Language Models (LLMs) like GPT-4 have shown promising performance in understanding natural language. However, the extent of their applicability in news recommendation systems remains to be validated. This paper introduces RecPrompt, the first framework for news recommend… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: 8 pages, 3 figures, and 8 tables

  9. arXiv:2311.16588  [pdf

    cs.CL

    Ascle: A Python Natural Language Processing Toolkit for Medical Text Generation

    Authors: Rui Yang, Qingcheng Zeng, Keen You, Yujie Qiao, Lucas Huang, Chia-Chun Hsieh, Benjamin Rosand, Jeremy Goldwasser, Amisha D Dave, Tiarnan D. L. Keenan, Emily Y Chew, Dragomir Radev, Zhiyong Lu, Hua Xu, Qingyu Chen, Irene Li

    Abstract: This study introduces Ascle, a pioneering natural language processing (NLP) toolkit designed for medical text generation. Ascle is tailored for biomedical researchers and healthcare professionals with an easy-to-use, all-in-one solution that requires minimal programming expertise. For the first time, Ascle evaluates and provides interfaces for the latest pre-trained language models, encompassing f… ▽ More

    Submitted 9 December, 2023; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: 5 figures, 4 tables

  10. arXiv:2311.04929  [pdf, other

    cs.CL cs.AI cs.DL cs.LG

    An Interdisciplinary Outlook on Large Language Models for Scientific Research

    Authors: James Boyko, Joseph Cohen, Nathan Fox, Maria Han Veiga, Jennifer I-Hsiu Li, Jing Liu, Bernardo Modenesi, Andreas H. Rauch, Kenneth N. Reid, Soumi Tribedi, Anastasia Visheratina, Xin Xie

    Abstract: In this paper, we describe the capabilities and constraints of Large Language Models (LLMs) within disparate academic disciplines, aiming to delineate their strengths and limitations with precision. We examine how LLMs augment scientific inquiry, offering concrete examples such as accelerating literature review by summarizing vast numbers of publications, enhancing code development through automat… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  11. arXiv:2310.02778  [pdf, other

    cs.CL cs.AI

    Integrating UMLS Knowledge into Large Language Models for Medical Question Answering

    Authors: Rui Yang, Edison Marrese-Taylor, Yuhe Ke, Lechao Cheng, Qingyu Chen, Irene Li

    Abstract: Large language models (LLMs) have demonstrated powerful text generation capabilities, bringing unprecedented innovation to the healthcare field. While LLMs hold immense promise for applications in healthcare, applying them to real clinical scenarios presents significant challenges, as these models may generate content that deviates from established medical facts and even exhibit potential biases.… ▽ More

    Submitted 13 October, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: 12 pages, 3 figures

  12. arXiv:2309.15630  [pdf, other

    cs.CL

    NLPBench: Evaluating Large Language Models on Solving NLP Problems

    Authors: Linxin Song, Jieyu Zhang, Lechao Cheng, Pengyuan Zhou, Tianyi Zhou, Irene Li

    Abstract: Recent developments in large language models (LLMs) have shown promise in enhancing the capabilities of natural language processing (NLP). Despite these successes, there remains a dearth of research dedicated to the NLP problem-solving abilities of LLMs. To fill the gap in this area, we present a unique benchmarking dataset, NLPBench, comprising 378 college-level NLP questions spanning various NLP… ▽ More

    Submitted 19 October, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

  13. arXiv:2308.10410  [pdf, other

    cs.CL

    Large Language Models on Wikipedia-Style Survey Generation: an Evaluation in NLP Concepts

    Authors: Fan Gao, Hang Jiang, Rui Yang, Qingcheng Zeng, Jinghui Lu, Moritz Blum, Dairui Liu, Tianwei She, Yuang Jiang, Irene Li

    Abstract: Educational materials such as survey articles in specialized fields like computer science traditionally require tremendous expert inputs and are therefore expensive to create and update. Recently, Large Language Models (LLMs) have achieved significant success across various general tasks. However, their effectiveness and limitations in the education domain are yet to be fully explored. In this wor… ▽ More

    Submitted 23 May, 2024; v1 submitted 20 August, 2023; originally announced August 2023.

    Journal ref: ACL 2024 Findings

  14. Going Beyond Local: Global Graph-Enhanced Personalized News Recommendations

    Authors: Boming Yang, Dairui Liu, Toyotaro Suzumura, Ruihai Dong, Irene Li

    Abstract: Precisely recommending candidate news articles to users has always been a core challenge for personalized news recommendation systems. Most recent works primarily focus on using advanced natural language processing techniques to extract semantic information from rich textual data, employing content-based methods derived from local historical news. However, this approach lacks a global perspective,… ▽ More

    Submitted 26 September, 2023; v1 submitted 13 July, 2023; originally announced July 2023.

    Comments: Recsys 2023, Best Student Paper

  15. arXiv:2306.07506  [pdf, other

    cs.IR

    Topic-Centric Explanations for News Recommendation

    Authors: Dairui Liu, Derek Greene, Irene Li, Xuefei Jiang, Ruihai Dong

    Abstract: News recommender systems (NRS) have been widely applied for online news websites to help users find relevant articles based on their interests. Recent methods have demonstrated considerable success in terms of recommendation performance. However, the lack of explanation for these recommendations can lead to mistrust among users and lack of acceptance of recommendations. To address this issue, we p… ▽ More

    Submitted 6 October, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: 20 pages

  16. arXiv:2305.03319  [pdf, other

    cs.CL

    HiPool: Modeling Long Documents Using Graph Neural Networks

    Authors: Irene Li, Aosong Feng, Dragomir Radev, Rex Ying

    Abstract: Encoding long sequences in Natural Language Processing (NLP) is a challenging problem. Though recent pretraining language models achieve satisfying performances in many NLP tasks, they are still restricted by a pre-defined maximum length, making them challenging to be extended to longer sequences. So some recent works utilize hierarchies to model long sequences. However, most of them apply sequent… ▽ More

    Submitted 14 May, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Journal ref: ACL 2023 main proceedings

  17. arXiv:2302.09665  [pdf, other

    cs.AI

    CitySpec with Shield: A Secure Intelligent Assistant for Requirement Formalization

    Authors: Zirong Chen, Issa Li, Haoxiang Zhang, Sarah Preum, John A. Stankovic, Meiyi Ma

    Abstract: An increasing number of monitoring systems have been developed in smart cities to ensure that the real-time operations of a city satisfy safety and performance requirements. However, many existing city requirements are written in English with missing, inaccurate, or ambiguous information. There is a high demand for assisting city policymakers in converting human-specified requirements to machine-u… ▽ More

    Submitted 30 March, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2206.03132

  18. arXiv:2302.06132  [pdf, other

    cs.CL

    NNKGC: Improving Knowledge Graph Completion with Node Neighborhoods

    Authors: Irene Li, Boming Yang

    Abstract: Knowledge graph completion (KGC) aims to discover missing relations of query entities. Current text-based models utilize the entity name and description to infer the tail entity given the head entity and a certain relation. Existing approaches also consider the neighborhood of the head entity. However, these methods tend to model the neighborhood using a flat structure and are only restricted to 1… ▽ More

    Submitted 19 October, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: DL4KG Workshop, ISWC 2023

  19. arXiv:2210.11794  [pdf, other

    cs.LG cs.CL

    Diffuser: Efficient Transformers with Multi-hop Attention Diffusion for Long Sequences

    Authors: Aosong Feng, Irene Li, Yuang Jiang, Rex Ying

    Abstract: Efficient Transformers have been developed for long sequence modeling, due to their subquadratic memory and time complexity. Sparse Transformer is a popular approach to improving the efficiency of Transformers by restricting self-attention to locations specified by the predefined sparse patterns. However, leveraging sparsity may sacrifice expressiveness compared to full-attention, when important t… ▽ More

    Submitted 31 January, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

  20. arXiv:2206.07152  [pdf, other

    cs.AI cs.FL cs.LG

    An Intelligent Assistant for Converting City Requirements to Formal Specification

    Authors: Zirong Chen, Isaac Li, Haoxiang Zhang, Sarah Preum, John Stankovic, Meiyi Ma

    Abstract: As more and more monitoring systems have been deployed to smart cities, there comes a higher demand for converting new human-specified requirements to machine-understandable formal specifications automatically. However, these human-specific requirements are often written in English and bring missing, inaccurate, or ambiguous information. In this paper, we present CitySpec, an intelligent assistant… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: This demo paper is accepted by SMARTCOMP 2022

  21. arXiv:2206.03132  [pdf, other

    cs.AI cs.CL cs.LG cs.SE

    CitySpec: An Intelligent Assistant System for Requirement Specification in Smart Cities

    Authors: Zirong Chen, Isaac Li, Haoxiang Zhang, Sarah Preum, John A. Stankovic, Meiyi Ma

    Abstract: An increasing number of monitoring systems have been developed in smart cities to ensure that real-time operations of a city satisfy safety and performance requirements. However, many existing city requirements are written in English with missing, inaccurate, or ambiguous information. There is a high demand for assisting city policy makers in converting human-specified requirements to machine-unde… ▽ More

    Submitted 14 June, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: This paper is accepted by SMARTCOMP 2022

  22. arXiv:2204.06604  [pdf, other

    cs.CL

    EHRKit: A Python Natural Language Processing Toolkit for Electronic Health Record Texts

    Authors: Irene Li, Keen You, Yujie Qiao, Lucas Huang, Chia-Chun Hsieh, Benjamin Rosand, Jeremy Goldwasser, Dragomir Radev

    Abstract: The Electronic Health Record (EHR) is an essential part of the modern medical system and impacts healthcare delivery, operations, and research. Unstructured text is attracting much attention despite structured information in the EHRs and has become an exciting research field. The success of the recent neural Natural Language Processing (NLP) method has led to a new direction for processing unstruc… ▽ More

    Submitted 27 June, 2023; v1 submitted 13 April, 2022; originally announced April 2022.

  23. arXiv:2201.02312  [pdf, other

    cs.CL cs.AI

    A Transfer Learning Pipeline for Educational Resource Discovery with Application in Leading Paragraph Generation

    Authors: Irene Li, Thomas George, Alexander Fabbri, Tammy Liao, Benjamin Chen, Rina Kawamura, Richard Zhou, Vanessa Yan, Swapnil Hingmire, Dragomir Radev

    Abstract: Effective human learning depends on a wide selection of educational materials that align with the learner's current understanding of the topic. While the Internet has revolutionized human learning or education, a substantial resource accessibility barrier still exists. Namely, the excess of online information can make it challenging to navigate and discover high-quality learning materials. In this… ▽ More

    Submitted 6 January, 2022; originally announced January 2022.

  24. arXiv:2112.08578  [pdf, other

    cs.CL

    CLICKER: A Computational LInguistics Classification Scheme for Educational Resources

    Authors: Swapnil Hingmire, Irene Li, Rena Kawamura, Benjamin Chen, Alexander Fabbri, Xiangru Tang, Yixin Liu, Thomas George, Tammy Liao, Wai Pan Wong, Vanessa Yan, Richard Zhou, Girish K. Palshikar, Dragomir Radev

    Abstract: A classification scheme of a scientific subject gives an overview of its body of knowledge. It can also be used to facilitate access to research articles and other materials related to the subject. For example, the ACM Computing Classification System (CCS) is used in the ACM Digital Library search interface and also for indexing computer science papers. We observed that a comprehensive classificat… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: 7 pages, 5 figures, 4 tables

  25. arXiv:2112.06377  [pdf, other

    cs.CL cs.LG

    Surfer100: Generating Surveys From Web Resources, Wikipedia-style

    Authors: Irene Li, Alexander Fabbri, Rina Kawamura, Yixin Liu, Xiangru Tang, Jaesung Tae, Chang Shen, Sally Ma, Tomoe Mizutani, Dragomir Radev

    Abstract: Fast-developing fields such as Artificial Intelligence (AI) often outpace the efforts of encyclopedic sources such as Wikipedia, which either do not completely cover recently-introduced topics or lack such content entirely. As a result, methods for automatically producing content are valuable tools to address this information overload. We show that recent advances in pretrained language modeling c… ▽ More

    Submitted 22 June, 2022; v1 submitted 12 December, 2021; originally announced December 2021.

    Comments: LREC 2022, main conference

  26. arXiv:2109.08722  [pdf, other

    cs.LG cs.CL

    Efficient Variational Graph Autoencoders for Unsupervised Cross-domain Prerequisite Chains

    Authors: Irene Li, Vanessa Yan, Dragomir Radev

    Abstract: Prerequisite chain learning helps people acquire new knowledge efficiently. While people may quickly determine learning paths over concepts in a domain, finding such paths in other domains can be challenging. We introduce Domain-Adversarial Variational Graph Autoencoders (DAVGAE) to solve this cross-domain prerequisite chain learning task efficiently. Our novel model consists of a variational grap… ▽ More

    Submitted 30 October, 2021; v1 submitted 17 September, 2021; originally announced September 2021.

    Comments: Accepted by the Efficient Natural Language and Speech Processing (ENLSP) Workshop, NeurIPS 2021

  27. arXiv:2107.07657  [pdf, other

    cs.DS

    Streaming and Distributed Algorithms for Robust Column Subset Selection

    Authors: Shuli Jiang, Dongyu Li, Irene Mengze Li, Arvind V. Mahankali, David P. Woodruff

    Abstract: We give the first single-pass streaming algorithm for Column Subset Selection with respect to the entrywise $\ell_p$-norm with $1 \leq p < 2$. We study the $\ell_p$ norm loss since it is often considered more robust to noise than the standard Frobenius norm. Given an input matrix $A \in \mathbb{R}^{d \times n}$ ($n \gg d$), our algorithm achieves a multiplicative… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

    Comments: Proceedings of the 38th International Conference on Machine Learning (ICML 2021)

  28. arXiv:2107.03891  [pdf, other

    cs.CV

    Technical Report for Valence-Arousal Estimation in ABAW2 Challenge

    Authors: Hong-Xia Xie, I-Hsuan Li, Ling Lo, Hong-Han Shuai, Wen-Huang Cheng

    Abstract: In this work, we describe our method for tackling the valence-arousal estimation challenge from ABAW2 ICCV-2021 Competition. The competition organizers provide an in-the-wild Aff-Wild2 dataset for participants to analyze affective behavior in real-life settings. We use a two stream model to learn emotion features from appearance and action respectively. To solve data imbalanced problem, we apply l… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2105.01502

  29. arXiv:2107.02975  [pdf, other

    cs.CL cs.AI

    Neural Natural Language Processing for Unstructured Data in Electronic Health Records: a Review

    Authors: Irene Li, Jessica Pan, Jeremy Goldwasser, Neha Verma, Wai Pan Wong, Muhammed Yavuz Nuzumlalı, Benjamin Rosand, Yixin Li, Matthew Zhang, David Chang, R. Andrew Taylor, Harlan M. Krumholz, Dragomir Radev

    Abstract: Electronic health records (EHRs), digital collections of patient healthcare events and observations, are ubiquitous in medicine and critical to healthcare delivery, operations, and research. Despite this central role, EHRs are notoriously difficult to process automatically. Well over half of the information stored within EHRs is in the form of unstructured text (e.g. provider notes, operation repo… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

    Comments: 33 pages, 11 figures

    MSC Class: 68T50 ACM Class: I.2.7

  30. arXiv:2105.03505  [pdf, other

    cs.CL

    Unsupervised Cross-Domain Prerequisite Chain Learning using Variational Graph Autoencoders

    Authors: Irene Li, Vanessa Yan, Tianxiao Li, Rihao Qu, Dragomir Radev

    Abstract: Learning prerequisite chains is an essential task for efficiently acquiring knowledge in both known and unknown domains. For example, one may be an expert in the natural language processing (NLP) domain but want to determine the best order to learn new concepts in an unfamiliar Computer Vision domain (CV). Both domains share some common concepts, such as machine learning basics and deep learning m… ▽ More

    Submitted 27 May, 2021; v1 submitted 7 May, 2021; originally announced May 2021.

    Comments: Accepted by ACL 2021

  31. arXiv:2105.01502  [pdf, other

    cs.CV

    Technical Report for Valence-Arousal Estimation on Affwild2 Dataset

    Authors: I-Hsuan Li

    Abstract: In this work, we describe our method for tackling the valence-arousal estimation challenge from ABAW FG-2020 Competition. The competition organizers provide an in-the-wild Aff-Wild2 dataset for participants to analyze affective behavior in real-life settings. We use MIMAMO Net \cite{deng2020mimamo} model to achieve information about micro-motion and macro-motion for improving video emotion recogni… ▽ More

    Submitted 13 May, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

  32. arXiv:2103.14620  [pdf, other

    cs.CL

    LiGCN: Label-interpretable Graph Convolutional Networks for Multi-label Text Classification

    Authors: Irene Li, Aosong Feng, Hao Wu, Tianxiao Li, Toyotaro Suzumura, Ruihai Dong

    Abstract: Multi-label text classification (MLTC) is an attractive and challenging task in natural language processing (NLP). Compared with single-label text classification, MLTC has a wider range of applications in practice. In this paper, we propose a label-interpretable graph convolutional network model to solve the MLTC problem by modeling tokens and labels as nodes in a heterogeneous graph. In this way,… ▽ More

    Submitted 22 May, 2022; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: 8 tables, 3 figures

    Journal ref: DLG4NLP Workshop, NAACL 2022

  33. arXiv:2102.02114  [pdf, other

    cs.CL

    Detecting Bias in Transfer Learning Approaches for Text Classification

    Authors: Irene Li

    Abstract: Classification is an essential and fundamental task in machine learning, playing a cardinal role in the field of natural language processing (NLP) and computer vision (CV). In a supervised learning setting, labels are always needed for the classification task. Especially for deep neural models, a large amount of high-quality labeled data are required for training. However, when a new domain comes… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

    Comments: 3 figures

  34. arXiv:2007.08100  [pdf, other

    cs.CL cs.LG

    Towards Debiasing Sentence Representations

    Authors: Paul Pu Liang, Irene Mengze Li, Emily Zheng, Yao Chong Lim, Ruslan Salakhutdinov, Louis-Philippe Morency

    Abstract: As natural language processing methods are increasingly deployed in real-world scenarios such as healthcare, legal systems, and social science, it becomes necessary to recognize the role they potentially play in shaping social biases and stereotypes. Previous work has revealed the presence of social biases in widely used word embeddings involving gender, race, religion, and other social constructs… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

    Comments: ACL 2020, code available at https://github.com/pliang279/sent_debias

  35. arXiv:2006.05573  [pdf, other

    cs.SI cs.LG physics.soc-ph

    Global Data Science Project for COVID-19

    Authors: Toyotaro Suzumura, Dario Garcia-Gasulla, Sergio Alvarez Napagao, Irene Li, Hiroshi Maruyama, Hiroki Kanezashi, Raquel P'erez-Arnal, Kunihiko Miyoshi, Euma Ishii, Keita Suzuki, Sayaka Shiba, Mariko Kurokawa, Yuta Kanzawa, Naomi Nakagawa, Masatoshi Hanai, Yixin Li, Tianxiao Li

    Abstract: This paper aims at providing the summary of the Global Data Science Project (GDSC) for COVID-19. as on May 31 2020. COVID-19 has largely impacted on our societies through both direct and indirect effects transmitted by the policy measures to counter the spread of viruses. We quantitatively analysed the multifaceted impacts of the COVID-19 pandemic on our societies including people's mobility, heal… ▽ More

    Submitted 3 August, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: 42 pages, 49 figures

  36. arXiv:2004.10899  [pdf, other

    cs.CL cs.CY cs.LG

    What are We Depressed about When We Talk about COVID19: Mental Health Analysis on Tweets Using Natural Language Processing

    Authors: Irene Li, Yixin Li, Tianxiao Li, Sergio Alvarez-Napagao, Dario Garcia-Gasulla, Toyotaro Suzumura

    Abstract: The outbreak of coronavirus disease 2019 (COVID-19) recently has affected human life to a great extent. Besides direct physical and economic threats, the pandemic also indirectly impact people's mental health conditions, which can be overwhelming but difficult to measure. The problem may come from various reasons such as unemployment status, stay-at-home policy, fear for the virus, and so forth. I… ▽ More

    Submitted 8 June, 2020; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: 7 pages, 7 figures

  37. arXiv:2004.10610  [pdf, other

    cs.CL

    R-VGAE: Relational-variational Graph Autoencoder for Unsupervised Prerequisite Chain Learning

    Authors: Irene Li, Alexander Fabbri, Swapnil Hingmire, Dragomir Radev

    Abstract: The task of concept prerequisite chain learning is to automatically determine the existence of prerequisite relationships among concept pairs. In this paper, we frame learning prerequisite relationships among concepts as an unsupervised task with no access to labeled concept pairs during training. We propose a model called the Relational-Variational Graph AutoEncoder (R-VGAE) to predict concept re… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

    Comments: 2 Figures, 3 Tables, 9 Pages

  38. arXiv:1910.14076  [pdf, other

    cs.CL

    A Neural Topic-Attention Model for Medical Term Abbreviation Disambiguation

    Authors: Irene Li, Michihiro Yasunaga, Muhammed Yavuz Nuzumlalı, Cesar Caraballo, Shiwani Mahajan, Harlan Krumholz, Dragomir Radev

    Abstract: Automated analysis of clinical notes is attracting increasing attention. However, there has not been much work on medical term abbreviation disambiguation. Such abbreviations are abundant, and highly ambiguous, in clinical documents. One of the main obstacles is the lack of large scale, balance labeled data sets. To address the issue, we propose a few-shot learning approach to take advantage of li… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

  39. arXiv:1909.01716  [pdf, other

    cs.CL cs.IR cs.LG

    ScisummNet: A Large Annotated Corpus and Content-Impact Models for Scientific Paper Summarization with Citation Networks

    Authors: Michihiro Yasunaga, Jungo Kasai, Rui Zhang, Alexander R. Fabbri, Irene Li, Dan Friedman, Dragomir R. Radev

    Abstract: Scientific article summarization is challenging: large, annotated corpora are not available, and the summary should ideally include the article's impacts on research community. This paper provides novel solutions to these two challenges. We 1) develop and release the first large-scale manually-annotated corpus for scientific papers (on computational linguistics) by enabling faster annotation, and… ▽ More

    Submitted 15 September, 2019; v1 submitted 4 September, 2019; originally announced September 2019.

    Comments: AAAI 2019

  40. arXiv:1906.02285  [pdf, other

    cs.CL cs.AI

    SParC: Cross-Domain Semantic Parsing in Context

    Authors: Tao Yu, Rui Zhang, Michihiro Yasunaga, Yi Chern Tan, Xi Victoria Lin, Suyi Li, Heyang Er, Irene Li, Bo Pang, Tao Chen, Emily Ji, Shreya Dixit, David Proctor, Sungrok Shim, Jonathan Kraft, Vincent Zhang, Caiming Xiong, Richard Socher, Dragomir Radev

    Abstract: We present SParC, a dataset for cross-domainSemanticParsing inContext that consists of 4,298 coherent question sequences (12k+ individual questions annotated with SQL queries). It is obtained from controlled user interactions with 200 complex databases over 138 domains. We provide an in-depth analysis of SParC and show that it introduces new challenges compared to existing datasets. SParC demonstr… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

    Comments: Accepted to ACL 2019, long paper

  41. arXiv:1906.01749  [pdf, other

    cs.CL

    Multi-News: a Large-Scale Multi-Document Summarization Dataset and Abstractive Hierarchical Model

    Authors: Alexander R. Fabbri, Irene Li, Tianwei She, Suyi Li, Dragomir R. Radev

    Abstract: Automatic generation of summaries from multiple news articles is a valuable tool as the number of online publications grows rapidly. Single document summarization (SDS) systems have benefited from advances in neural encoder-decoder model thanks to the availability of large datasets. However, multi-document summarization (MDS) of news articles has been limited to datasets of a couple of hundred exa… ▽ More

    Submitted 19 June, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: ACL 2019, 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 2019

  42. arXiv:1811.12181  [pdf, other

    cs.CY cs.CL cs.IR cs.LG stat.ML

    What Should I Learn First: Introducing LectureBank for NLP Education and Prerequisite Chain Learning

    Authors: Irene Li, Alexander R. Fabbri, Robert R. Tung, Dragomir R. Radev

    Abstract: Recent years have witnessed the rising popularity of Natural Language Processing (NLP) and related fields such as Artificial Intelligence (AI) and Machine Learning (ML). Many online courses and resources are available even for those without a strong background in the field. Often the student is curious about a specific topic but does not quite know where to begin studying. To answer the question o… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

  43. arXiv:1809.08887  [pdf, other

    cs.CL cs.AI

    Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task

    Authors: Tao Yu, Rui Zhang, Kai Yang, Michihiro Yasunaga, Dongxu Wang, Zifan Li, James Ma, Irene Li, Qingning Yao, Shanelle Roman, Zilin Zhang, Dragomir Radev

    Abstract: We present Spider, a large-scale, complex and cross-domain semantic parsing and text-to-SQL dataset annotated by 11 college students. It consists of 10,181 questions and 5,693 unique complex SQL queries on 200 databases with multiple tables, covering 138 different domains. We define a new complex and cross-domain semantic parsing and text-to-SQL task where different complex SQL queries and databas… ▽ More

    Submitted 2 February, 2019; v1 submitted 24 September, 2018; originally announced September 2018.

    Comments: EMNLP 2018, Long Paper

  44. arXiv:1805.04617  [pdf, other

    cs.CL

    TutorialBank: A Manually-Collected Corpus for Prerequisite Chains, Survey Extraction and Resource Recommendation

    Authors: Alexander R. Fabbri, Irene Li, Prawat Trairatvorakul, Yijiao He, Wei Tai Ting, Robert Tung, Caitlin Westerfield, Dragomir R. Radev

    Abstract: The field of Natural Language Processing (NLP) is growing rapidly, with new research published daily along with an abundance of tutorials, codebases and other online resources. In order to learn this dynamic field or stay up-to-date on the latest research, students as well as educators and researchers must constantly sift through multiple sources to find valuable, relevant information. To address… ▽ More

    Submitted 11 May, 2018; originally announced May 2018.

    Comments: ACL 2018, 56th Annual Meeting of the Association for Computational Linguistics, Melbourne, Australia, 2018

  45. arXiv:1704.06841  [pdf

    cs.CL

    Medical Text Classification using Convolutional Neural Networks

    Authors: Mark Hughes, Irene Li, Spyros Kotoulas, Toyotaro Suzumura

    Abstract: We present an approach to automatically classify clinical text at a sentence level. We are using deep convolutional neural networks to represent complex features. We train the network on a dataset providing a broad categorization of health information. Through a detailed evaluation, we demonstrate that our method outperforms several approaches widely used in natural language processing tasks by ab… ▽ More

    Submitted 22 April, 2017; originally announced April 2017.