Skip to main content

Showing 1–8 of 8 results for author: Hogan, W R

  1. arXiv:2405.00186  [pdf

    cs.AI cs.DB cs.IR

    Credentials in the Occupation Ontology

    Authors: John Beverley, Robin McGill, Sam Smith, Jie Zheng, Giacomo De Colle, Finn Wilson, Matthew Diller, William D. Duncan, William R. Hogan, Yongqun He

    Abstract: The term credential encompasses educational certificates, degrees, certifications, and government-issued licenses. An occupational credential is a verification of an individuals qualification or competence issued by a third party with relevant authority. Job seekers often leverage such credentials as evidence that desired qualifications are satisfied by their holders. Many U.S. education and workf… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 11

  2. A Study of Generative Large Language Model for Medical Research and Healthcare

    Authors: Cheng Peng, Xi Yang, Aokun Chen, Kaleb E Smith, Nima PourNejatian, Anthony B Costa, Cheryl Martin, Mona G Flores, Ying Zhang, Tanja Magoc, Gloria Lipori, Duane A Mitchell, Naykky S Ospina, Mustafa M Ahmed, William R Hogan, Elizabeth A Shenkman, Yi Guo, Jiang Bian, Yonghui Wu

    Abstract: There is enormous enthusiasm and concerns in using large language models (LLMs) in healthcare, yet current assumptions are all based on general-purpose LLMs such as ChatGPT. This study develops a clinical generative LLM, GatorTronGPT, using 277 billion words of mixed clinical and English text with a GPT-3 architecture of 20 billion parameters. GatorTronGPT improves biomedical natural language proc… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  3. Clinical Concept and Relation Extraction Using Prompt-based Machine Reading Comprehension

    Authors: Cheng Peng, Xi Yang, Zehao Yu, Jiang Bian, William R. Hogan, Yonghui Wu

    Abstract: Objective: To develop a natural language processing system that solves both clinical concept extraction and relation extraction in a unified prompt-based machine reading comprehension (MRC) architecture with good generalizability for cross-institution applications. Methods: We formulate both clinical concept extraction and relation extraction using a unified prompt-based MRC architecture and exp… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

  4. arXiv:2212.03000  [pdf

    cs.CL cs.AI cs.LG

    SODA: A Natural Language Processing Package to Extract Social Determinants of Health for Cancer Studies

    Authors: Zehao Yu, Xi Yang, Chong Dang, Prakash Adekkanattu, Braja Gopal Patra, Yifan Peng, Jyotishman Pathak, Debbie L. Wilson, Ching-Yuan Chang, Wei-Hsuan Lo-Ciganic, Thomas J. George, William R. Hogan, Yi Guo, Jiang Bian, Yonghui Wu

    Abstract: Objective: We aim to develop an open-source natural language processing (NLP) package, SODA (i.e., SOcial DeterminAnts), with pre-trained transformer models to extract social determinants of health (SDoH) for cancer patients, examine the generalizability of SODA to a new disease domain (i.e., opioid use), and evaluate the extraction rate of SDoH using cancer populations. Methods: We identified S… ▽ More

    Submitted 18 May, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    ACM Class: I.2.7

    Journal ref: Journal of Biomedical Informatics, April 2024, 104642

  5. Ontology Development Kit: a toolkit for building, maintaining, and standardising biomedical ontologies

    Authors: Nicolas Matentzoglu, Damien Goutte-Gattat, Shawn Zheng Kai Tan, James P. Balhoff, Seth Carbon, Anita R. Caron, William D. Duncan, Joe E. Flack, Melissa Haendel, Nomi L. Harris, William R Hogan, Charles Tapley Hoyt, Rebecca C. Jackson, HyeongSik Kim, Huseyin Kir, Martin Larralde, Julie A. McMurry, James A. Overton, Bjoern Peters, Clare Pilgrim, Ray Stefancsik, Sofia MC Robb, Sabrina Toro, Nicole A Vasilevsky, Ramona Walls , et al. (2 additional authors not shown)

    Abstract: Similar to managing software packages, managing the ontology life cycle involves multiple complex workflows such as preparing releases, continuous quality control checking, and dependency management. To manage these processes, a diverse set of tools is required, from command line utilities to powerful ontology engineering environments such as ROBOT. Particularly in the biomedical domain, which has… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: 19 pages, 2 supplementary tables, 1 supplementary figure

  6. arXiv:2203.03540  [pdf

    cs.CL cs.AI cs.LG

    GatorTron: A Large Clinical Language Model to Unlock Patient Information from Unstructured Electronic Health Records

    Authors: Xi Yang, Aokun Chen, Nima PourNejatian, Hoo Chang Shin, Kaleb E Smith, Christopher Parisien, Colin Compas, Cheryl Martin, Mona G Flores, Ying Zhang, Tanja Magoc, Christopher A Harle, Gloria Lipori, Duane A Mitchell, William R Hogan, Elizabeth A Shenkman, Jiang Bian, Yonghui Wu

    Abstract: There is an increasing interest in developing artificial intelligence (AI) systems to process and interpret electronic health records (EHRs). Natural language processing (NLP) powered by pretrained language models is the key technology for medical AI systems utilizing clinical narratives. However, there are few clinical language models, the largest of which trained in the clinical domain is compar… ▽ More

    Submitted 16 December, 2022; v1 submitted 2 February, 2022; originally announced March 2022.

    Comments: 24 pages, 2 figures, 3 tables

  7. arXiv:2108.04949  [pdf

    cs.CL cs.LG

    A Study of Social and Behavioral Determinants of Health in Lung Cancer Patients Using Transformers-based Natural Language Processing Models

    Authors: Zehao Yu, Xi Yang, Chong Dang, Songzi Wu, Prakash Adekkanattu, Jyotishman Pathak, Thomas J. George, William R. Hogan, Yi Guo, Jiang Bian, Yonghui Wu

    Abstract: Social and behavioral determinants of health (SBDoH) have important roles in shaping people's health. In clinical research studies, especially comparative effectiveness studies, failure to adjust for SBDoH factors will potentially cause confounding issues and misclassification errors in either statistical analyses and machine learning-based models. However, there are limited studies to examine SBD… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

    Comments: 9 pages; 2 figures, 4 tables, AMIA 2021

  8. arXiv:1910.00582  [pdf

    q-bio.QM cs.LG stat.ML

    Identifying Cancer Patients at Risk for Heart Failure Using Machine Learning Methods

    Authors: Xi Yang, Yan Gong, Nida Waheed, Keith March, Jiang Bian, William R. Hogan, Yonghui Wu

    Abstract: Cardiotoxicity related to cancer therapies has become a serious issue, diminishing cancer treatment outcomes and quality of life. Early detection of cancer patients at risk for cardiotoxicity before cardiotoxic treatments and providing preventive measures are potential solutions to improve cancer patients's quality of life. This study focuses on predicting the development of heart failure in cance… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

    Comments: 6 pages, 1 figure, 3 tables, accepted by AMIA 2019

    Journal ref: AMIA Annu Symp Proc (2019) 933-941