Skip to main content

Showing 1–6 of 6 results for author: Drake, B

  1. arXiv:2312.14129  [pdf, other

    cs.LG cs.AI cs.IR

    WellFactor: Patient Profiling using Integrative Embedding of Healthcare Data

    Authors: Dongjin Choi, Andy Xiang, Ozgur Ozturk, Deep Shrestha, Barry Drake, Hamid Haidarian, Faizan Javed, Haesun Park

    Abstract: In the rapidly evolving healthcare industry, platforms now have access to not only traditional medical records, but also diverse data sets encompassing various patient interactions, such as those from healthcare web portals. To address this rich diversity of data, we introduce WellFactor: a method that derives patient profiles by integrating information from these sources. Central to our approach… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 2023 IEEE International Conference on Big Data (IEEE BigData 2023)

  2. Patient Clustering via Integrated Profiling of Clinical and Digital Data

    Authors: Dongjin Choi, Andy Xiang, Ozgur Ozturk, Deep Shrestha, Barry Drake, Hamid Haidarian, Faizan Javed, Haesun Park

    Abstract: We introduce a novel profile-based patient clustering model designed for clinical data in healthcare. By utilizing a method grounded on constrained low-rank approximation, our model takes advantage of patients' clinical data and digital interaction data, including browsing and search, to construct patient profiles. As a result of the method, nonnegative embedding vectors are generated, serving as… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted for the Short Paper track of CIKM'23, October 21-25, 2023, Birmingham, United Kingdom

  3. arXiv:2205.09488  [pdf

    cs.SE cs.LG cs.NI

    PSI Draft Specification

    Authors: Mark Reid, James Montgomery, Barry Drake, Avraham Ruderman

    Abstract: This document presents the draft specification for delivering machine learning services over HTTP, developed as part of the Protocols and Structures for Inference project, which concluded in 2013. It presents the motivation for providing machine learning as a service, followed by a description of the essential and optional components of such a service.

    Submitted 1 May, 2022; originally announced May 2022.

    Comments: Software specification for PSI machine learning web services. 42 pages, 2 figures

  4. arXiv:1907.12079  [pdf, other

    cs.IR cs.HC

    TopicSifter: Interactive Search Space Reduction Through Targeted Topic Modeling

    Authors: Hannah Kim, Dongjin Choi, Barry Drake, Alex Endert, Haesun Park

    Abstract: Topic modeling is commonly used to analyze and understand large document collections. However, in practice, users want to focus on specific aspects or "targets" rather than the entire corpus. For example, given a large collection of documents, users may want only a smaller subset which more closely aligns with their interests, tasks, and domains. In particular, our paper focuses on large-scale doc… ▽ More

    Submitted 28 July, 2019; originally announced July 2019.

  5. arXiv:1703.09646  [pdf, other

    cs.LG stat.ML

    Hybrid Clustering based on Content and Connection Structure using Joint Nonnegative Matrix Factorization

    Authors: Rundong Du, Barry Drake, Haesun Park

    Abstract: We present a hybrid method for latent information discovery on the data sets containing both text content and connection structure based on constrained low rank approximation. The new method jointly optimizes the Nonnegative Matrix Factorization (NMF) objective function for text clustering and the Symmetric NMF (SymNMF) objective function for graph clustering. We propose an effective algorithm for… ▽ More

    Submitted 28 March, 2017; originally announced March 2017.

    Comments: 9 pages, Submitted to a conference, Feb. 2017

  6. arXiv:1509.01208   

    cs.LG cs.IR math.NA

    Fast Clustering and Topic Modeling Based on Rank-2 Nonnegative Matrix Factorization

    Authors: Da Kuang, Barry Drake, Haesun Park

    Abstract: The importance of unsupervised clustering and topic modeling is well recognized with ever-increasing volumes of text data. In this paper, we propose a fast method for hierarchical clustering and topic modeling called HierNMF2. Our method is based on fast Rank-2 nonnegative matrix factorization (NMF) that performs binary clustering and an efficient node splitting rule. Further utilizing the final l… ▽ More

    Submitted 2 October, 2015; v1 submitted 3 September, 2015; originally announced September 2015.

    Comments: This paper has been withdrawn by the author to clarify the authorship

    ACM Class: F.2.1; H.3.3