Skip to main content

Showing 1–48 of 48 results for author: Downey, D

  1. arXiv:2406.07835  [pdf, other

    cs.CL cs.AI

    SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature

    Authors: David Wadden, Kejian Shi, Jacob Morrison, Aakanksha Naik, Shruti Singh, Nitzan Barzilay, Kyle Lo, Tom Hope, Luca Soldaini, Shannon Zejiang Shen, Doug Downey, Hannaneh Hajishirzi, Arman Cohan

    Abstract: We present SciRIFF (Scientific Resource for Instruction-Following and Finetuning), a dataset of 137K instruction-following demonstrations for 54 tasks covering five essential scientific literature understanding capabilities: information extraction, summarization, question answering, claim verification, and classification. SciRIFF demonstrations are notable for their long input contexts, detailed t… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: Submitted to NeurIPS Datasets and Benchmarks 2024

  2. arXiv:2405.01796  [pdf, other

    cs.CL cs.DL cs.IR

    TOPICAL: TOPIC Pages AutomagicaLly

    Authors: John Giorgi, Amanpreet Singh, Doug Downey, Sergey Feldman, Lucy Lu Wang

    Abstract: Topic pages aggregate useful information about an entity or concept into a single succinct and accessible article. Automated creation of topic pages would enable their rapid curation as information resources, providing an alternative to traditional web search. While most prior work has focused on generating topic pages about biographical entities, in this work, we develop a completely automated pr… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 10 pages, 7 figures, 2 tables, NAACL System Demonstrations 2024

  3. arXiv:2401.04259  [pdf, other

    cs.CL

    MARG: Multi-Agent Review Generation for Scientific Papers

    Authors: Mike D'Arcy, Tom Hope, Larry Birnbaum, Doug Downey

    Abstract: We study the ability of LLMs to generate feedback for scientific papers and develop MARG, a feedback generation approach using multiple LLM instances that engage in internal discussion. By distributing paper text across agents, MARG can consume the full text of papers beyond the input length limitations of the base LLM, and by specializing agents and incorporating sub-tasks tailored to different c… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  4. arXiv:2311.11301  [pdf, other

    cs.CL

    CHAMP: Efficient Annotation and Consolidation of Cluster Hierarchies

    Authors: Arie Cattan, Tom Hope, Doug Downey, Roy Bar-Haim, Lilach Eden, Yoav Kantor, Ido Dagan

    Abstract: Various NLP tasks require a complex hierarchical structure over nodes, where each node is a cluster of items. Examples include generating entailment graphs, hierarchical cross-document coreference resolution, annotating event and subevent relations, etc. To enable efficient annotation of such hierarchical structures, we release CHAMP, an open source tool allowing to incrementally construct both cl… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023

  5. arXiv:2311.09736  [pdf, other

    cs.CL

    CARE: Extracting Experimental Findings From Clinical Literature

    Authors: Aakanksha Naik, Bailey Kuehl, Erin Bransom, Doug Downey, Tom Hope

    Abstract: Extracting fine-grained experimental findings from literature can provide dramatic utility for scientific applications. Prior work has developed annotation schemas and datasets for limited aspects of this problem, failing to capture the real-world complexity and nuance required. Focusing on biomedicine, this work presents CARE -- a new IE dataset for the task of extracting clinical findings. We de… ▽ More

    Submitted 24 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: To appear at NAACL Findings 2024

  6. arXiv:2306.12587  [pdf, other

    cs.CL

    ARIES: A Corpus of Scientific Paper Edits Made in Response to Peer Reviews

    Authors: Mike D'Arcy, Alexis Ross, Erin Bransom, Bailey Kuehl, Jonathan Bragg, Tom Hope, Doug Downey

    Abstract: Revising scientific papers based on peer feedback is a challenging task that requires not only deep scientific knowledge and reasoning, but also the ability to recognize the implicit requests in high-level feedback and to choose the best of many possible ways to update the manuscript in response. We introduce this task for large language models and release ARIES, a dataset of review comments and t… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: 11 pages, 2 figures

  7. arXiv:2306.01058  [pdf, other

    cs.CL

    Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific Documents

    Authors: Catherine Chen, Zejiang Shen, Dan Klein, Gabriel Stanovsky, Doug Downey, Kyle Lo

    Abstract: Recent work has shown that infusing layout features into language models (LMs) improves processing of visually-rich documents such as scientific papers. Layout-infused LMs are often evaluated on documents with familiar layout features (e.g., papers from the same publisher), but in practice models encounter documents with unfamiliar distributions of layout features, such as new combinations of text… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: To appear in ACL Findings 2023

  8. arXiv:2305.14259  [pdf, other

    cs.CL cs.AI cs.LG

    SciMON: Scientific Inspiration Machines Optimized for Novelty

    Authors: Qingyun Wang, Doug Downey, Heng Ji, Tom Hope

    Abstract: We explore and enhance the ability of neural language models to generate novel scientific directions grounded in literature. Work on literature-based hypothesis generation has traditionally focused on binary link prediction--severely limiting the expressivity of hypotheses. This line of work also does not focus on optimizing novelty. We take a dramatic departure with a novel setting in which model… ▽ More

    Submitted 3 June, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 21 pages. Code and resource are available at https://github.com/EagleW/CLBD Accepted by the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)

  9. arXiv:2305.00366  [pdf, other

    cs.CL cs.IR cs.LG

    S2abEL: A Dataset for Entity Linking from Scientific Tables

    Authors: Yuze Lou, Bailey Kuehl, Erin Bransom, Sergey Feldman, Aakanksha Naik, Doug Downey

    Abstract: Entity linking (EL) is the task of linking a textual mention to its corresponding entry in a knowledge base, and is critical for many knowledge-intensive NLP applications. When applied to tables in scientific papers, EL is a step toward large-scale scientific knowledge bases that could enable advanced scientific question answering and analytics. We present the first dataset for EL in scientific ta… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

  10. arXiv:2304.02623  [pdf, other

    cs.CL cs.HC

    Beyond Summarization: Designing AI Support for Real-World Expository Writing Tasks

    Authors: Zejiang Shen, Tal August, Pao Siangliulue, Kyle Lo, Jonathan Bragg, Jeff Hammerbacher, Doug Downey, Joseph Chee Chang, David Sontag

    Abstract: Large language models have introduced exciting new opportunities and challenges in designing and developing new AI-assisted writing support tools. Recent work has shown that leveraging this new technology can transform writing in many scenarios such as ideation during creative writing, editing support, and summarization. However, AI-supported expository writing--including real-world tasks like sch… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: 3 pages, 1 figure, accepted by The Second Workshop on Intelligent and Interactive Writing Assistants

  11. arXiv:2303.14334  [pdf, other

    cs.HC cs.AI cs.CL

    The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces

    Authors: Kyle Lo, Joseph Chee Chang, Andrew Head, Jonathan Bragg, Amy X. Zhang, Cassidy Trier, Chloe Anastasiades, Tal August, Russell Authur, Danielle Bragg, Erin Bransom, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Yen-Sung Chen, Evie Yu-Yen Cheng, Yvonne Chou, Doug Downey, Rob Evans, Raymond Fok, Fangzhou Hu, Regan Huff, Dongyeop Kang, Tae Soo Kim, Rodney Kinney , et al. (30 additional authors not shown)

    Abstract: Scholarly publications are key to the transfer of knowledge from scholars to others. However, research papers are information-dense, and as the volume of the scientific literature grows, the need for new technology to support the reading process grows. In contrast to the process of finding papers, which has been transformed by Internet technology, the experience of reading research papers has chan… ▽ More

    Submitted 23 April, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

  12. CiteSee: Augmenting Citations in Scientific Papers with Persistent and Personalized Historical Context

    Authors: Joseph Chee Chang, Amy X. Zhang, Jonathan Bragg, Andrew Head, Kyle Lo, Doug Downey, Daniel S. Weld

    Abstract: When reading a scholarly article, inline citations help researchers contextualize the current article and discover relevant prior work. However, it can be challenging to prioritize and make sense of the hundreds of citations encountered during literature reviews. This paper introduces CiteSee, a paper reading tool that leverages a user's publishing, reading, and saving activities to provide person… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

  13. arXiv:2302.06754  [pdf, other

    cs.HC cs.DL cs.IR

    Relatedly: Scaffolding Literature Reviews with Existing Related Work Sections

    Authors: Srishti Palani, Aakanksha Naik, Doug Downey, Amy X. Zhang, Jonathan Bragg, Joseph Chee Chang

    Abstract: Scholars who want to research a scientific topic must take time to read, extract meaning, and identify connections across many papers. As scientific literature grows, this becomes increasingly challenging. Meanwhile, authors summarize prior research in papers' related work sections, though this is scoped to support a single paper. A formative study found that while reading multiple related work pa… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

  14. arXiv:2301.10140  [pdf, other

    cs.DL cs.CL

    The Semantic Scholar Open Data Platform

    Authors: Rodney Kinney, Chloe Anastasiades, Russell Authur, Iz Beltagy, Jonathan Bragg, Alexandra Buraczynski, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Arman Cohan, Miles Crawford, Doug Downey, Jason Dunkelberger, Oren Etzioni, Rob Evans, Sergey Feldman, Joseph Gorney, David Graham, Fangzhou Hu, Regan Huff, Daniel King, Sebastian Kohlmeier, Bailey Kuehl, Michael Langan, Daniel Lin , et al. (23 additional authors not shown)

    Abstract: The volume of scientific output is creating an urgent need for automated tools to help scientists keep up with developments in their field. Semantic Scholar (S2) is an open data platform and website aimed at accelerating science by helping scholars discover and understand scientific literature. We combine public and proprietary data sources using state-of-the-art techniques for scholarly PDF conte… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: 8 pages, 6 figures

  15. arXiv:2212.09246  [pdf, other

    cs.CL

    I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation

    Authors: Chandra Bhagavatula, Jena D. Hwang, Doug Downey, Ronan Le Bras, Ximing Lu, Lianhui Qin, Keisuke Sakaguchi, Swabha Swayamdipta, Peter West, Yejin Choi

    Abstract: Commonsense capabilities of pre-trained language models dramatically improve with scale, leading many to believe that scale is the only winning recipe. But is it? Here, we investigate an alternative that a priori seems impossible: can smaller language models (e.g., GPT-2) win over models that are orders of magnitude larger and better (e.g., GPT-3), if powered with novel commonsense distillation al… ▽ More

    Submitted 26 May, 2023; v1 submitted 18 December, 2022; originally announced December 2022.

    Comments: ACL 2023

  16. arXiv:2211.13308  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    SciRepEval: A Multi-Format Benchmark for Scientific Document Representations

    Authors: Amanpreet Singh, Mike D'Arcy, Arman Cohan, Doug Downey, Sergey Feldman

    Abstract: Learned representations of scientific documents can serve as valuable input features for downstream tasks without further fine-tuning. However, existing benchmarks for evaluating these representations fail to capture the diversity of relevant tasks. In response, we introduce SciRepEval, the first comprehensive benchmark for training and evaluating scientific document representations. It includes 2… ▽ More

    Submitted 13 November, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: 19 pages, 2 figures, 11 tables. Accepted in EMNLP 2023 Main Conference

  17. arXiv:2210.12607  [pdf, other

    cs.CL cs.AI cs.LG

    Learning to Perform Complex Tasks through Compositional Fine-Tuning of Language Models

    Authors: Victor S. Bursztyn, David Demeter, Doug Downey, Larry Birnbaum

    Abstract: How to usefully encode compositional task structure has long been a core challenge in AI. Recent work in chain of thought prompting has shown that for very large neural language models (LMs), explicitly demonstrating the inferential steps involved in a target task may improve performance over end-to-end learning that focuses on the target task alone. However, chain of thought prompting has signifi… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: Accepted to findings of EMNLP 2022. Data and code available at https://github.com/vbursztyn/compositional-fine-tuning

  18. FeedLens: Polymorphic Lenses for Personalizing Exploratory Search over Knowledge Graphs

    Authors: Harmanpreet Kaur, Doug Downey, Amanpreet Singh, Evie Yu-Yen Cheng, Daniel S. Weld, Jonathan Bragg

    Abstract: The vast scale and open-ended nature of knowledge graphs (KGs) make exploratory search over them cognitively demanding for users. We introduce a new technique, polymorphic lenses, that improves exploratory search over a KG by obtaining new leverage from the existing preference models that KG-based systems maintain for recommending content. The approach is based on a simple but powerful observation… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: To appear at UIST 2022

  19. arXiv:2207.04993  [pdf, other

    cs.CL

    Embedding Recycling for Language Models

    Authors: Jon Saad-Falcon, Amanpreet Singh, Luca Soldaini, Mike D'Arcy, Arman Cohan, Doug Downey

    Abstract: Real-world applications of neural language models often involve running many different models over the same corpus. The high computational cost of these runs has led to interest in techniques that can reuse the contextualized embeddings produced in previous runs to speed training and inference of future ones. We refer to this approach as embedding recycling (ER). While multiple ER techniques have… ▽ More

    Submitted 30 January, 2023; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: EACL Findings 2023

  20. arXiv:2206.10883  [pdf, other

    cs.CL cs.CY

    Multi-LexSum: Real-World Summaries of Civil Rights Lawsuits at Multiple Granularities

    Authors: Zejiang Shen, Kyle Lo, Lauren Yu, Nathan Dahlberg, Margo Schlanger, Doug Downey

    Abstract: With the advent of large language models, methods for abstractive summarization have made great strides, creating potential for use in applications to aid knowledge workers processing unwieldy document collections. One such setting is the Civil Rights Litigation Clearinghouse (CRLC) (https://clearinghouse.net),which posts information about large-scale civil rights lawsuits, serving lawyers, schola… ▽ More

    Submitted 22 July, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: 37 pages, 2 figures, 9 tables

  21. arXiv:2205.11658  [pdf, other

    cs.CL

    Penguins Don't Fly: Reasoning about Generics through Instantiations and Exceptions

    Authors: Emily Allaway, Jena D. Hwang, Chandra Bhagavatula, Kathleen McKeown, Doug Downey, Yejin Choi

    Abstract: Generics express generalizations about the world (e.g., birds can fly) that are not universally true (e.g., newborn birds and penguins cannot fly). Commonsense knowledge bases, used extensively in NLP, encode some generic knowledge but rarely enumerate such exceptions and knowing when a generic statement holds or does not hold true is crucial for developing a comprehensive understanding of generic… ▽ More

    Submitted 24 March, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: EACL 2023

  22. arXiv:2205.08012  [pdf, other

    cs.CL cs.AI cs.LG

    CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction

    Authors: Tara Safavi, Doug Downey, Tom Hope

    Abstract: Knowledge graph (KG) link prediction is a fundamental task in artificial intelligence, with applications in natural language processing, information retrieval, and biomedicine. Recently, promising results have been achieved by leveraging cross-modal information in KGs, using ensembles that combine knowledge graph embeddings (KGEs) and contextual language models (LMs). However, existing ensembles a… ▽ More

    Submitted 23 September, 2022; v1 submitted 16 May, 2022; originally announced May 2022.

    Comments: AKBC 2022

  23. arXiv:2205.06982  [pdf, other

    cs.CL cs.AI cs.HC

    ACCoRD: A Multi-Document Approach to Generating Diverse Descriptions of Scientific Concepts

    Authors: Sonia K. Murthy, Kyle Lo, Daniel King, Chandra Bhagavatula, Bailey Kuehl, Sophie Johnson, Jonathan Borchardt, Daniel S. Weld, Tom Hope, Doug Downey

    Abstract: Systems that can automatically define unfamiliar terms hold the promise of improving the accessibility of scientific texts, especially for readers who may lack prerequisite background knowledge. However, current systems assume a single "best" description per concept, which fails to account for the many potentially useful ways a concept can be described. We present ACCoRD, an end-to-end system tack… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

  24. arXiv:2205.02007  [pdf, other

    cs.CL cs.CY cs.HC cs.IR

    A Computational Inflection for Scientific Discovery

    Authors: Tom Hope, Doug Downey, Oren Etzioni, Daniel S. Weld, Eric Horvitz

    Abstract: We stand at the foot of a significant inflection in the trajectory of scientific discovery. As society continues on its fast-paced digital transformation, so does humankind's collective scientific knowledge and discourse. We now read and write papers in digitized form, and a great deal of the formal and informal processes of science are captured digitally -- including papers, preprints and books,… ▽ More

    Submitted 24 May, 2023; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: Accepted to CACM

  25. arXiv:2204.10838  [pdf, other

    cs.DL cs.CY cs.SI

    S2AMP: A High-Coverage Dataset of Scholarly Mentorship Inferred from Publications

    Authors: Shaurya Rohatgi, Doug Downey, Daniel King, Sergey Feldman

    Abstract: Mentorship is a critical component of academia, but is not as visible as publications, citations, grants, and awards. Despite the importance of studying the quality and impact of mentorship, there are few large representative mentorship datasets available. We contribute two datasets to the study of mentorship. The first has over 300,000 ground truth academic mentor-mentee pairs obtained from multi… ▽ More

    Submitted 29 April, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Journal ref: The ACM/IEEE Joint Conference on Digital Libraries in 2022 (JCDL '22), June 20-24, 2022, Cologne, Germany

  26. arXiv:2204.10254  [pdf, other

    cs.IR cs.HC cs.SI

    From Who You Know to What You Read: Augmenting Scientific Recommendations with Implicit Social Networks

    Authors: Hyeonsu B. Kang, Rafal Kocielnik, Andrew Head, Jiangjiang Yang, Matt Latzke, Aniket Kittur, Daniel S. Weld, Doug Downey, Jonathan Bragg

    Abstract: The ever-increasing pace of scientific publication necessitates methods for quickly identifying relevant papers. While neural recommenders trained on user interests can help, they still result in long, monotonous lists of suggested papers. To improve the discovery experience we introduce multiple new methods for \em augmenting recommendations with textual relevance messages that highlight knowledg… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: to be published in ACM SIGCHI 2022

  27. arXiv:2203.08436  [pdf, other

    cs.CL

    Don't Say What You Don't Know: Improving the Consistency of Abstractive Summarization by Constraining Beam Search

    Authors: Daniel King, Zejiang Shen, Nishant Subramani, Daniel S. Weld, Iz Beltagy, Doug Downey

    Abstract: Abstractive summarization systems today produce fluent and relevant output, but often "hallucinate" statements not supported by the source text. We analyze the connection between hallucinations and training data, and find evidence that models hallucinate because they train on target summaries that are unsupported by the source. Based on our findings, we present PINOCCHIO, a new decoding method tha… ▽ More

    Submitted 17 November, 2023; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: 16 pages, 2 figures, 7 tables

  28. arXiv:2111.08284  [pdf, other

    cs.CL

    Few-Shot Self-Rationalization with Natural Language Prompts

    Authors: Ana Marasović, Iz Beltagy, Doug Downey, Matthew E. Peters

    Abstract: Self-rationalization models that predict task labels and generate free-text elaborations for their predictions could enable more intuitive interaction with NLP systems. These models are, however, currently trained with a large amount of human-written free-text explanations for each task which hinders their broader usage. We propose to study a more realistic setting of self-rationalization using fe… ▽ More

    Submitted 25 April, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: v2: NAACL Findings 2022 accepted paper camera-ready version. First two authors contributed equally. 9 pages main, 3 pages appendix

  29. arXiv:2109.13301  [pdf, other

    cs.IR cs.HC cs.LG

    Exploring The Role of Local and Global Explanations in Recommender Systems

    Authors: Marissa Radensky, Doug Downey, Kyle Lo, Zoran Popović, Daniel S. Weld

    Abstract: Explanations are well-known to improve recommender systems' transparency. These explanations may be local, explaining an individual recommendation, or global, explaining the recommender model in general. Despite their widespread use, there has been little investigation into the relative benefits of these two approaches. Do they provide the same benefits to users, or do they serve different purpose… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  30. arXiv:2109.07576  [pdf, other

    cs.CL cs.AI

    "It doesn't look good for a date": Transforming Critiques into Preferences for Conversational Recommendation Systems

    Authors: Victor S. Bursztyn, Jennifer Healey, Nedim Lipka, Eunyee Koh, Doug Downey, Larry Birnbaum

    Abstract: Conversations aimed at determining good recommendations are iterative in nature. People often express their preferences in terms of a critique of the current recommendation (e.g., "It doesn't look good for a date"), requiring some degree of common sense for a preference to be inferred. In this work, we present a method for transforming a user critique into a positive preference (e.g., "I prefer mo… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP 2021's main conference

  31. arXiv:2106.00676  [pdf, other

    cs.CL cs.CV

    VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups

    Authors: Zejiang Shen, Kyle Lo, Lucy Lu Wang, Bailey Kuehl, Daniel S. Weld, Doug Downey

    Abstract: Accurately extracting structured content from PDFs is a critical first step for NLP over scientific papers. Recent work has improved extraction accuracy by incorporating elementary layout information, e.g., each token's 2D position on the page, into language model pretraining. We introduce new methods that explicitly model VIsual LAyout (VILA) groups, i.e., text lines or text blocks, to further im… ▽ More

    Submitted 5 January, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: To appear in TACL 2022. The arXiv version is a pre-MIT Press publication version. (17 pages, 5 figures, 9 tables)

  32. arXiv:2104.08809  [pdf, other

    cs.CL cs.IR cs.LG

    SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts

    Authors: Arie Cattan, Sophie Johnson, Daniel Weld, Ido Dagan, Iz Beltagy, Doug Downey, Tom Hope

    Abstract: Determining coreference of concept mentions across multiple documents is a fundamental task in natural language understanding. Previous work on cross-document coreference resolution (CDCR) typically considers mentions of events in the news, which seldom involve abstract technical concepts that are prevalent in science and technology. These complex concepts take diverse or ambiguous forms and have… ▽ More

    Submitted 1 September, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: Accepted to AKBC 2021. Data and code available at https://scico.apps.allenai.org/

  33. arXiv:2103.07534  [pdf, other

    cs.DL

    S2AND: A Benchmark and Evaluation System for Author Name Disambiguation

    Authors: Shivashankar Subramanian, Daniel King, Doug Downey, Sergey Feldman

    Abstract: Author Name Disambiguation (AND) is the task of resolving which author mentions in a bibliographic database refer to the same real-world person, and is a critical ingredient of digital library applications such as search and citation analysis. While many AND algorithms have been proposed, comparing them is difficult because they often employ distinct features and are evaluated on different dataset… ▽ More

    Submitted 21 February, 2022; v1 submitted 12 March, 2021; originally announced March 2021.

    Journal ref: JCDL 2021

  34. Simplified Data Wrangling with ir_datasets

    Authors: Sean MacAvaney, Andrew Yates, Sergey Feldman, Doug Downey, Arman Cohan, Nazli Goharian

    Abstract: Managing the data for Information Retrieval (IR) experiments can be challenging. Dataset documentation is scattered across the Internet and once one obtains a copy of the data, there are numerous different data formats to work with. Even basic formats can have subtle dataset-specific nuances that need to be considered for proper use. To help mitigate these challenges, we introduce a new robust and… ▽ More

    Submitted 10 May, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: SIGIR 2021 Resource

  35. ABNIRML: Analyzing the Behavior of Neural IR Models

    Authors: Sean MacAvaney, Sergey Feldman, Nazli Goharian, Doug Downey, Arman Cohan

    Abstract: Pretrained contextualized language models such as BERT and T5 have established a new state-of-the-art for ad-hoc search. However, it is not yet well-understood why these methods are so effective, what makes some variants more effective than others, and what pitfalls they may have. We present a new comprehensive framework for Analyzing the Behavior of Neural IR ModeLs (ABNIRML), which includes new… ▽ More

    Submitted 20 July, 2023; v1 submitted 1 November, 2020; originally announced November 2020.

    Comments: TACL version

  36. High-Precision Extraction of Emerging Concepts from Scientific Literature

    Authors: Daniel King, Doug Downey, Daniel S. Weld

    Abstract: Identification of new concepts in scientific literature can help power faceted search, scientific trend analysis, knowledge-base construction, and more, but current methods are lacking. Manual identification cannot keep up with the torrent of new publications, while the precision of existing automatic techniques is too low for many applications. We present an unsupervised concept extraction method… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: Accepted to SIGIR 2020

    Journal ref: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (2020) 1549-1552

  37. arXiv:2005.02433  [pdf, other

    cs.LG stat.ML

    Stolen Probability: A Structural Weakness of Neural Language Models

    Authors: David Demeter, Gregory Kimmel, Doug Downey

    Abstract: Neural Network Language Models (NNLMs) generate probability distributions by applying a softmax function to a distance metric formed by taking the dot product of a prediction vector with all word vectors in a high-dimensional embedding space. The dot-product distance metric forms part of the inductive bias of NNLMs. Although NNLMs optimize well with this inductive bias, we show that this results i… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: Preprint of paper accepted for ACL-2020

  38. Generative Data Augmentation for Commonsense Reasoning

    Authors: Yiben Yang, Chaitanya Malaviya, Jared Fernandez, Swabha Swayamdipta, Ronan Le Bras, Ji-Ping Wang, Chandra Bhagavatula, Yejin Choi, Doug Downey

    Abstract: Recent advances in commonsense reasoning depend on large-scale human-annotated training data to achieve peak performance. However, manual curation of training examples is expensive and has been shown to introduce annotation artifacts that neural models can readily exploit and overfit on. We investigate G-DAUG^C, a novel generative data augmentation method that aims to achieve more accurate and rob… ▽ More

    Submitted 16 November, 2020; v1 submitted 24 April, 2020; originally announced April 2020.

    Comments: Findings of the Association for Computational Linguistics: EMNLP 2020

  39. arXiv:2004.10964  [pdf, other

    cs.CL cs.LG

    Don't Stop Pretraining: Adapt Language Models to Domains and Tasks

    Authors: Suchin Gururangan, Ana Marasović, Swabha Swayamdipta, Kyle Lo, Iz Beltagy, Doug Downey, Noah A. Smith

    Abstract: Language models pretrained on text from a wide variety of sources form the foundation of today's NLP. In light of the success of these broad-coverage models, we investigate whether it is still helpful to tailor a pretrained model to the domain of a target task. We present a study across four domains (biomedical and computer science publications, news, and reviews) and eight classification tasks, s… ▽ More

    Submitted 5 May, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: ACL 2020

  40. arXiv:2004.07180  [pdf, other

    cs.CL

    SPECTER: Document-level Representation Learning using Citation-informed Transformers

    Authors: Arman Cohan, Sergey Feldman, Iz Beltagy, Doug Downey, Daniel S. Weld

    Abstract: Representation learning is a critical ingredient for natural language processing systems. Recent Transformer language models like BERT learn powerful textual representations, but these models are targeted towards token- and sentence-level training objectives and do not leverage information on inter-document relatedness, which limits their document-level representation power. For applications on sc… ▽ More

    Submitted 20 May, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

    Comments: ACL 2020

  41. arXiv:2003.04315  [pdf, ps, other

    cs.IR cs.LG stat.ML

    LIMEADE: From AI Explanations to Advice Taking

    Authors: Benjamin Charles Germain Lee, Doug Downey, Kyle Lo, Daniel S. Weld

    Abstract: Research in human-centered AI has shown the benefits of systems that can explain their predictions. Methods that allow an AI to take advice from humans in response to explanations are similarly useful. While both capabilities are well-developed for transparent learning models (e.g., linear models and GA$^2$Ms), and recent techniques (e.g., LIME and SHAP) can generate explanations for opaque models… ▽ More

    Submitted 17 January, 2023; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: 18 pages, 7 figures

  42. arXiv:1912.05421  [pdf, other

    cs.LG stat.ML

    Just Add Functions: A Neural-Symbolic Language Model

    Authors: David Demeter, Doug Downey

    Abstract: Neural network language models (NNLMs) have achieved ever-improving accuracy due to more sophisticated architectures and increasing amounts of training data. However, the inductive bias of these models (formed by the distributional hypothesis of language), while ideally suited to modeling most running text, results in key limitations for today's models. In particular, the models often struggle to… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: Preprint of paper accepted for AAAI-2020

  43. arXiv:1909.09483  [pdf, other

    cs.CL

    Multi-sense Definition Modeling using Word Sense Decompositions

    Authors: Ruimin Zhu, Thanapon Noraset, Alisa Liu, Wenxin Jiang, Doug Downey

    Abstract: Word embeddings capture syntactic and semantic information about words. Definition modeling aims to make the semantic content in each embedding explicit, by outputting a natural language definition based on the embedding. However, existing definition models are limited in their ability to generate accurate definitions for different senses of the same word. In this paper, we introduce a new method… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

  44. arXiv:1908.05739  [pdf, other

    cs.CL

    Abductive Commonsense Reasoning

    Authors: Chandra Bhagavatula, Ronan Le Bras, Chaitanya Malaviya, Keisuke Sakaguchi, Ari Holtzman, Hannah Rashkin, Doug Downey, Scott Wen-tau Yih, Yejin Choi

    Abstract: Abductive reasoning is inference to the most plausible explanation. For example, if Jenny finds her house in a mess when she returns from work, and remembers that she left a window open, she can hypothesize that a thief broke into her house and caused the mess, as the most plausible explanation. While abduction has long been considered to be at the core of how people interpret and read between the… ▽ More

    Submitted 13 February, 2020; v1 submitted 15 August, 2019; originally announced August 2019.

    Comments: ICLR 2020 Camera Ready

  45. arXiv:1904.04365  [pdf, ps, other

    cs.CL cs.LG

    CODAH: An Adversarially Authored Question-Answer Dataset for Common Sense

    Authors: Michael Chen, Mike D'Arcy, Alisa Liu, Jared Fernandez, Doug Downey

    Abstract: Commonsense reasoning is a critical AI capability, but it is difficult to construct challenging datasets that test common sense. Recent neural question answering systems, based on large pre-trained models of language, have already achieved near-human-level performance on commonsense knowledge benchmarks. These systems do not possess human-level common sense, but are able to exploit limitations of… ▽ More

    Submitted 26 July, 2019; v1 submitted 8 April, 2019; originally announced April 2019.

    Comments: 8 pages, Appeared in RepEval 2019

  46. arXiv:1901.09848  [pdf, other

    cs.CL cs.LG physics.soc-ph

    A new evaluation framework for topic modeling algorithms based on synthetic corpora

    Authors: Hanyu Shi, Martin Gerlach, Isabel Diersen, Doug Downey, Luis A. N. Amaral

    Abstract: Topic models are in widespread use in natural language processing and beyond. Here, we propose a new framework for the evaluation of probabilistic topic modeling algorithms based on synthetic corpora containing an unambiguously defined ground truth topic structure. The major innovation of our approach is the ability to quantify the agreement between the planted and inferred topic structures by com… ▽ More

    Submitted 28 January, 2019; originally announced January 2019.

    Comments: accepted for AISTATS 2019; code available at https://github.com/amarallab/synthetic_benchmark_topic_model; Main text (11 pages, 5 figures) and Supplementary Material (14 pages, 11 figures)

  47. arXiv:1805.02262  [pdf, other

    cs.CL

    Construction of the Literature Graph in Semantic Scholar

    Authors: Waleed Ammar, Dirk Groeneveld, Chandra Bhagavatula, Iz Beltagy, Miles Crawford, Doug Downey, Jason Dunkelberger, Ahmed Elgohary, Sergey Feldman, Vu Ha, Rodney Kinney, Sebastian Kohlmeier, Kyle Lo, Tyler Murray, Hsu-Han Ooi, Matthew Peters, Joanna Power, Sam Skjonsberg, Lucy Lu Wang, Chris Wilhelm, Zheng Yuan, Madeleine van Zuylen, Oren Etzioni

    Abstract: We describe a deployed scalable system for organizing published scientific literature into a heterogeneous graph to facilitate algorithmic manipulation and discovery. The resulting literature graph consists of more than 280M nodes, representing papers, authors, entities and various interactions between them (e.g., authorships, citations, entity mentions). We reduce literature graph construction in… ▽ More

    Submitted 6 May, 2018; originally announced May 2018.

    Comments: To appear in NAACL 2018 industry track

  48. arXiv:1612.00394  [pdf, ps, other

    cs.CL

    Definition Modeling: Learning to define word embeddings in natural language

    Authors: Thanapon Noraset, Chen Liang, Larry Birnbaum, Doug Downey

    Abstract: Distributed representations of words have been shown to capture lexical semantics, as demonstrated by their effectiveness in word similarity and analogical relation tasks. But, these tasks only evaluate lexical semantics indirectly. In this paper, we study whether it is possible to utilize distributed representations to generate dictionary definitions of words, as a more direct and transparent rep… ▽ More

    Submitted 1 December, 2016; originally announced December 2016.

    Comments: To appear in AAAI Conference 2017