Skip to main content

Showing 1–50 of 70 results for author: Brown, E

  1. arXiv:2407.00541  [pdf

    cs.CL cs.AI cs.IR

    Answering real-world clinical questions using large language model based systems

    Authors: Yen Sia Low, Michael L. Jackson, Rebecca J. Hyde, Robert E. Brown, Neil M. Sanghavi, Julian D. Baldwin, C. William Pike, Jananee Muralidharan, Gavin Hui, Natasha Alexander, Hadeel Hassan, Rahul V. Nene, Morgan Pike, Courtney J. Pokrzywa, Shivam Vedak, Adam Paul Yan, Dong-han Yao, Amy R. Zipursky, Christina Dinh, Philip Ballentine, Dan C. Derieg, Vladimir Polony, Rehan N. Chawdry, Jordan Davies, Brigham B. Hyde , et al. (2 additional authors not shown)

    Abstract: Evidence to guide healthcare decisions is often limited by a lack of relevant and trustworthy literature as well as difficulty in contextualizing existing research for a specific patient. Large language models (LLMs) could potentially address both challenges by either summarizing published literature or generating new studies based on real-world data (RWD). We evaluated the ability of five LLM-bas… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 28 pages (2 figures, 3 tables) inclusive of 8 pages of supplemental materials (4 supplemental figures and 4 supplemental tables)

  2. arXiv:2406.19225  [pdf, other

    cs.CV

    ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation

    Authors: Nazanin Moradinasab, Laura S. Shankman, Rebecca A. Deaton, Gary K. Owens, Donald E. Brown

    Abstract: Domain adaptive semantic segmentation aims to generate accurate and dense predictions for an unlabeled target domain by leveraging a supervised model trained on a labeled source domain. The prevalent self-training approach involves retraining the dense discriminative classifier of $p(class|pixel feature)$ using the pseudo-labels from the target domain. While many methods focus on mitigating the is… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  3. arXiv:2406.17987  [pdf, other

    cs.CL cs.AI

    Multi-step Knowledge Retrieval and Inference over Unstructured Data

    Authors: Aditya Kalyanpur, Kailash Saravanakumar, Victor Barres, CJ McFate, Lori Moon, Nati Seifu, Maksim Eremeev, Jose Barrera, Eric Brown, David Ferrucci

    Abstract: The advent of Large Language Models (LLMs) and Generative AI has revolutionized natural language applications across various domains. However, high-stakes decision-making tasks in fields such as medical, legal and finance require a level of precision, comprehensiveness, and logical consistency that pure LLM or Retrieval-Augmented-Generation (RAG) approaches often fail to deliver. At Elemental Cogn… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  4. arXiv:2406.16860  [pdf, other

    cs.CV

    Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

    Authors: Shengbang Tong, Ellis Brown, Penghao Wu, Sanghyun Woo, Manoj Middepogu, Sai Charitha Akula, Jihan Yang, Shusheng Yang, Adithya Iyer, Xichen Pan, Austin Wang, Rob Fergus, Yann LeCun, Saining Xie

    Abstract: We introduce Cambrian-1, a family of multimodal LLMs (MLLMs) designed with a vision-centric approach. While stronger language models can enhance multimodal capabilities, the design choices for vision components are often insufficiently explored and disconnected from visual representation learning research. This gap hinders accurate sensory grounding in real-world scenarios. Our study uses LLMs and… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Website at https://cambrian-mllm.github.io

  5. arXiv:2406.00116  [pdf, other

    cs.HC cs.LG

    A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning

    Authors: Eura Nofshin, Esther Brown, Brian Lim, Weiwei Pan, Finale Doshi-Velez

    Abstract: Existing user studies suggest that different tasks may require explanations with different properties. However, user studies are expensive. In this paper, we introduce a generalizable, cost-effective method for identifying task-relevant explanation properties in silico, which can guide the design of more expensive user studies. We use our approach to identify relevant proxies for three example tas… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  6. arXiv:2405.16820  [pdf, other

    cs.LG cs.AI cs.CY cs.HC

    Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings

    Authors: Robert Wolfe, Isaac Slaughter, Bin Han, Bingbing Wen, Yiwei Yang, Lucas Rosenblatt, Bernease Herman, Eva Brown, Zening Qu, Nic Weber, Bill Howe

    Abstract: The rapid proliferation of generative AI has raised questions about the competitiveness of lower-parameter, locally tunable, open-weight models relative to high-parameter, API-guarded, closed-weight models in terms of performance, domain adaptation, cost, and generalization. Centering under-resourced yet risk-intolerant settings in government, research, and healthcare, we see for-profit closed-wei… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted at the ACM Conference on Fairness, Accountability, and Transparency (FAccT) 2024

  7. arXiv:2404.12048  [pdf, other

    cs.LO

    Symbolic Computation for All the Fun

    Authors: Chad E. Brown, Mikoláš Janota, Mirek Olšák

    Abstract: Motivated by the recent 10 million dollar AIMO challenge, this paper targets the problem of finding all functions conforming to a given specification. This is a popular problem at mathematical competitions and it brings about a number of challenges, primarily, synthesizing the possible solutions and proving that no other solutions exist. Often, there are infinitely many solutions and then the set… ▽ More

    Submitted 23 June, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  8. arXiv:2404.08213  [pdf, other

    cs.HC

    GazePointAR: A Context-Aware Multimodal Voice Assistant for Pronoun Disambiguation in Wearable Augmented Reality

    Authors: Jaewook Lee, Jun Wang, Elizabeth Brown, Liam Chu, Sebastian S. Rodriguez, Jon E. Froehlich

    Abstract: Voice assistants (VAs) like Siri and Alexa are transforming human-computer interaction; however, they lack awareness of users' spatiotemporal context, resulting in limited performance and unnatural dialogue. We introduce GazePointAR, a fully-functional context-aware VA for wearable augmented reality that leverages eye gaze, pointing gestures, and conversation history to disambiguate speech queries… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  9. arXiv:2404.06672  [pdf, other

    cs.SE cs.CY

    Biomedical Open Source Software: Crucial Packages and Hidden Heroes

    Authors: Andrew Nesbitt, Boris Veytsman, Daniel Mietchen, Eva Maxfield Brown, James Howison, João Felipe Pimentel, Laurent Hèbert-Dufresne, Stephan Druskat

    Abstract: Despite the importance of scientific software for research, it is often not formally recognized and rewarded. This is especially true for foundation libraries, which are used by the software packages visible to the users, being ``hidden'' themselves. The funders and other organizations need to understand the complex network of computer programs that the modern research relies upon. In this work… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  10. arXiv:2404.01761  [pdf, other

    cs.LO math.CO

    A Formal Proof of R(4,5)=25

    Authors: Thibault Gauthier, Chad E. Brown

    Abstract: In 1995, McKay and Radziszowski proved that the Ramsey number R(4,5) is equal to 25. Their proof relies on a combination of high-level arguments and computational steps. The authors have performed the computational parts of the proof with different implementations in order to reduce the possibility of an error in their programs. In this work, we prove this theorem in the interactive theorem prover… ▽ More

    Submitted 12 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  11. arXiv:2403.19831  [pdf, other

    cs.GT

    TASR: A Novel Trust-Aware Stackelberg Routing Algorithm to Mitigate Traffic Congestion

    Authors: Doris E. M. Brown, Venkata Sriram Siddhardh Nadendla, Sajal K. Das

    Abstract: Stackelberg routing platforms (SRP) reduce congestion in one-shot traffic networks by proposing optimal route recommendations to selfish travelers. Traditionally, Stackelberg routing is cast as a partial control problem where a fraction of traveler flow complies with route recommendations, while the remaining respond as selfish travelers. In this paper, a novel Stackelberg routing framework is for… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  12. arXiv:2402.03310  [pdf, other

    cs.AI cs.CV

    V-IRL: Grounding Virtual Intelligence in Real Life

    Authors: Jihan Yang, Runyu Ding, Ellis Brown, Xiaojuan Qi, Saining Xie

    Abstract: There is a sensory gulf between the Earth that humans inhabit and the digital realms in which modern AI agents are created. To develop AI agents that can sense, think, and act as flexibly as humans in real-world settings, it is imperative to bridge the realism gap between the digital and physical worlds. How can we embody agents in an environment as rich and diverse as the one we inhabit, without… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Project page: https://virl-platform.github.io

  13. arXiv:2312.11761  [pdf, other

    cs.AI

    MineObserver 2.0: A Deep Learning & In-Game Framework for Assessing Natural Language Descriptions of Minecraft Imagery

    Authors: Jay Mahajan, Samuel Hum, Jack Henhapl, Diya Yunus, Matthew Gadbury, Emi Brown, Jeff Ginger, H. Chad Lane

    Abstract: MineObserver 2.0 is an AI framework that uses Computer Vision and Natural Language Processing for assessing the accuracy of learner-generated descriptions of Minecraft images that include some scientifically relevant content. The system automatically assesses the accuracy of participant observations, written in natural language, made during science learning activities that take place in Minecraft.… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  14. arXiv:2312.01435  [pdf, other

    cs.CV

    Automatic Report Generation for Histopathology images using pre-trained Vision Transformers and BERT

    Authors: Saurav Sengupta, Donald E. Brown

    Abstract: Deep learning for histopathology has been successfully used for disease classification, image segmentation and more. However, combining image and text modalities using current state-of-the-art (SOTA) methods has been a challenge due to the high resolution of histopathology images. Automatic report generation for histopathology images is one such challenge. In this work, we show that using an exist… ▽ More

    Submitted 15 March, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

    Comments: Accepted at IEEE ISBI 2024. arXiv admin note: substantial text overlap with arXiv:2311.06176

  15. arXiv:2311.06176  [pdf, other

    cs.CV

    Automatic Report Generation for Histopathology images using pre-trained Vision Transformers

    Authors: Saurav Sengupta, Donald E. Brown

    Abstract: Deep learning for histopathology has been successfully used for disease classification, image segmentation and more. However, combining image and text modalities using current state-of-the-art methods has been a challenge due to the high resolution of histopathology images. Automatic report generation for histopathology images is one such challenge. In this work, we show that using an existing pre… ▽ More

    Submitted 13 November, 2023; v1 submitted 10 November, 2023; originally announced November 2023.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 09 pages

  16. arXiv:2309.10187  [pdf, other

    cs.HC

    Automated Interviewer or Augmented Survey? Collecting Social Data with Large Language Models

    Authors: Alejandro Cuevas Villalba, Eva M. Brown, Jennifer V. Scurrell, Jason Entenmann, Madeleine I. G. Daepp

    Abstract: Qualitative methods like interviews produce richer data in comparison with quantitative surveys, but are difficult to scale. Switching from web-based questionnaires to interactive chatbots offers a compromise, improving user engagement and response quality. Uptake remains limited, however, because of differences in users' expectations versus the capabilities of natural language processing methods.… ▽ More

    Submitted 10 October, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

  17. arXiv:2309.03744  [pdf, other

    eess.IV cs.CV

    Label-efficient Contrastive Learning-based model for nuclei detection and classification in 3D Cardiovascular Immunofluorescent Images

    Authors: Nazanin Moradinasab, Rebecca A. Deaton, Laura S. Shankman, Gary K. Owens, Donald E. Brown

    Abstract: Recently, deep learning-based methods achieved promising performance in nuclei detection and classification applications. However, training deep learning-based methods requires a large amount of pixel-wise annotated data, which is time-consuming and labor-intensive, especially in 3D images. An alternative approach is to adapt weak-annotation methods, such as labeling each nucleus with a point, but… ▽ More

    Submitted 14 January, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: 11 pages, 5 figures, MICCAI Workshop Conference 2023

  18. arXiv:2304.13907  [pdf, other

    cs.SI

    Network Analysis as a Tool for Shaping Conservation and Development Policy: A Case Study of Timber Market Optimization in India

    Authors: Xiou Ge, Sarah E. Brown, Pushpendra Rana, Lav R. Varshney, Daniel C. Miller

    Abstract: The incorporation of trees on farms can help to improve livelihoods and build resilience among small-holder farmers in developing countries. On-farm trees can help gen- erate additional income from commercial tree harvest as well as contribute significant environmental benefits and ecosystem services to increase resiliency. Long-term benefits from tree-based livelihoods, however, depend on sustain… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Paper accepted to proceedings of the 5th Data for Good Exchange (D4GX)

  19. arXiv:2304.02986  [pdf, ps, other

    cs.LO

    A Mathematical Benchmark for Inductive Theorem Provers

    Authors: Thibault Gauthier, Chad E. Brown, Mikolas Janota, Josef Urban

    Abstract: We present a benchmark of 29687 problems derived from the On-Line Encyclopedia of Integer Sequences (OEIS). Each problem expresses the equivalence of two syntactically different programs generating the same OEIS sequence. Such programs were conjectured by a learning-guided synthesis system using a language with looping operators. The operators implement recursion, and thus many of the proofs requi… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  20. arXiv:2303.16203  [pdf, other

    cs.LG cs.AI cs.CV cs.NE cs.RO

    Your Diffusion Model is Secretly a Zero-Shot Classifier

    Authors: Alexander C. Li, Mihir Prabhudesai, Shivam Duggal, Ellis Brown, Deepak Pathak

    Abstract: The recent wave of large-scale text-to-image diffusion models has dramatically increased our text-based image generation abilities. These models can generate realistic images for a staggering variety of prompts and exhibit impressive compositional generalization abilities. Almost all use cases thus far have solely focused on sampling; however, diffusion models can also provide conditional density… ▽ More

    Submitted 12 September, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: In ICCV 2023. Website at https://diffusion-classifier.github.io/

  21. arXiv:2302.14177  [pdf, other

    cs.DL cs.SE

    Soft-Search: Two Datasets to Study the Identification and Production of Research Software

    Authors: Eva Maxfield Brown, Lindsey Schwartz, Richard Lewei Huang, Nicholas Weber

    Abstract: Software is an important tool for scholarly work, but software produced for research is in many cases not easily identifiable or discoverable. A potential first step in linking research and software is software identification. In this paper we present two datasets to study the identification and production of research software. The first dataset contains almost 1000 human labeled annotations of so… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  22. arXiv:2302.14051  [pdf, other

    cs.LG cs.AI cs.CV cs.NE cs.RO

    Internet Explorer: Targeted Representation Learning on the Open Web

    Authors: Alexander C. Li, Ellis Brown, Alexei A. Efros, Deepak Pathak

    Abstract: Modern vision models typically rely on fine-tuning general-purpose models pre-trained on large, static datasets. These general-purpose models only capture the knowledge within their pre-training datasets, which are tiny, out-of-date snapshots of the Internet -- where billions of images are uploaded each day. We suggest an alternate approach: rather than hoping our static datasets transfer to our d… ▽ More

    Submitted 6 September, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: In ICML 2023. Website at https://internet-explorer-ssl.github.io/

  23. Analyzing historical diagnosis code data from NIH N3C and RECOVER Programs using deep learning to determine risk factors for Long Covid

    Authors: Saurav Sengupta, Johanna Loomba, Suchetha Sharma, Donald E. Brown, Lorna Thorpe, Melissa A Haendel, Christopher G Chute, Stephanie Hong

    Abstract: Post-acute sequelae of SARS-CoV-2 infection (PASC) or Long COVID is an emerging medical condition that has been observed in several patients with a positive diagnosis for COVID-19. Historical Electronic Health Records (EHR) like diagnosis codes, lab results and clinical notes have been analyzed using deep learning and have been used to predict future clinical events. In this paper, we propose an i… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  24. arXiv:2208.05561  [pdf, other

    cs.LG cs.AI

    SSDBCODI: Semi-Supervised Density-Based Clustering with Outliers Detection Integrated

    Authors: Jiahao Deng, Eli T. Brown

    Abstract: Clustering analysis is one of the critical tasks in machine learning. Traditionally, clustering has been an independent task, separate from outlier detection. Due to the fact that the performance of clustering can be significantly eroded by outliers, a small number of algorithms try to incorporate outlier detection in the process of clustering. However, most of those algorithms are based on unsupe… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

  25. arXiv:2208.00098  [pdf, other

    cs.CV cs.AI

    Weakly Supervised Deep Instance Nuclei Detection using Points Annotation in 3D Cardiovascular Immunofluorescent Images

    Authors: Nazanin Moradinasab, Yash Sharma, Laura S. Shankman, Gary K. Owens, Donald E. Brown

    Abstract: Two major causes of death in the United States and worldwide are stroke and myocardial infarction. The underlying cause of both is thrombi released from ruptured or eroded unstable atherosclerotic plaques that occlude vessels in the heart (myocardial infarction) or the brain (stroke). Clinical studies show that plaque composition plays a more important role than lesion size in plaque rupture or er… ▽ More

    Submitted 29 July, 2022; originally announced August 2022.

  26. arXiv:2206.14437  [pdf, other

    cs.CV

    MaNi: Maximizing Mutual Information for Nuclei Cross-Domain Unsupervised Segmentation

    Authors: Yash Sharma, Sana Syed, Donald E. Brown

    Abstract: In this work, we propose a mutual information (MI) based unsupervised domain adaptation (UDA) method for the cross-domain nuclei segmentation. Nuclei vary substantially in structure and appearances across different cancer types, leading to a drop in performance of deep learning models when trained on one cancer type and tested on another. This domain shift becomes even more critical as accurate se… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: Accepted at MICCAI 2022

  27. arXiv:2205.06640  [pdf, ps, other

    cs.LO

    Lash 1.0 (System Description)

    Authors: Chad E. Brown, Cezary Kaliszyk

    Abstract: Lash is a higher-order automated theorem prover created as a fork of the theorem prover Satallax. The basic underlying calculus of Satallax is a ground tableau calculus whose rules only use shallow information about the terms and formulas taking part in the rule. Lash uses new, efficient C representations of vital structures and operations. Most importantly, Lash uses a C representation of (normal… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Journal ref: IJCAR 2022 Conference Submission

  28. arXiv:2204.12432  [pdf, other

    cs.LG eess.SP

    Encoding Cardiopulmonary Exercise Testing Time Series as Images for Classification using Convolutional Neural Network

    Authors: Yash Sharma, Nick Coronato, Donald E. Brown

    Abstract: Exercise testing has been available for more than a half-century and is a remarkably versatile tool for diagnostic and prognostic information of patients for a range of diseases, especially cardiovascular and pulmonary. With rapid advancements in technology, wearables, and learning algorithm in the last decade, its scope has evolved. Specifically, Cardiopulmonary exercise testing (CPX) is one of t… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: Accepted in NeurIPS 2021 - MLPH Workshop; EMBC 2022. Code: https://github.com/YashSharma/MultivariateTimeSeries

  29. arXiv:2204.09110  [pdf

    cs.DL

    Councils in Action: Automating the Curation of Municipal Governance Data for Research

    Authors: Eva Maxfield Brown, Nicholas Weber

    Abstract: Large scale comparative research into municipal governance is often prohibitively difficult due to a lack of high-quality data. But, recent advances in speech-to-text algorithms and natural language processing has made it possible to more easily collect and analyze data about municipal governments. In this paper, we introduce an open-source platform, the Council Data Project (CDP), to curate novel… ▽ More

    Submitted 31 August, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: Keywords: public interest technology; municipal governance; data curation; computational data access; natural language processing To Be Published with 2022 ASIS&T Annual Meeting (https://www.asist.org/am22/)

  30. arXiv:2112.09051  [pdf

    astro-ph.SR cs.LG

    Simultaneous Multivariate Forecast of Space Weather Indices using Deep Neural Network Ensembles

    Authors: Bernard Benson, Edward Brown, Stefano Bonasera, Giacomo Acciarini, Jorge A. Pérez-Hernández, Eric Sutton, Moriba K. Jah, Christopher Bridges, Meng Jin, Atılım Güneş Baydin

    Abstract: Solar radio flux along with geomagnetic indices are important indicators of solar activity and its effects. Extreme solar events such as flares and geomagnetic storms can negatively affect the space environment including satellites in low-Earth orbit. Therefore, forecasting these space weather indices is of great importance in space operations and science. In this study, we propose a model based o… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: Fourth Workshop on Machine Learning and the Physical Sciences (NeurIPS 2021)

  31. arXiv:2112.01537  [pdf, other

    cs.HC cs.AI cs.LG

    Improving mathematical questioning in teacher training

    Authors: Debajyoti Datta, Maria Phillips, James P Bywater, Jennifer Chiu, Ginger S. Watson, Laura E. Barnes, Donald E Brown

    Abstract: High-fidelity, AI-based simulated classroom systems enable teachers to rehearse effective teaching strategies. However, dialogue-oriented open-ended conversations such as teaching a student about scale factors can be difficult to model. This paper builds a text-based interactive conversational agent to help teachers practice mathematical questioning skills based on the well-known Instructional Qua… ▽ More

    Submitted 6 December, 2021; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: Accepted to appear at the NeurIPS 2021 Human Centered AI Workshop (HCAI). Data collection process for this data is described here arXiv:2112.00985

  32. arXiv:2112.00985  [pdf, other

    cs.AI cs.HC cs.LG

    Evaluation of mathematical questioning strategies using data collected through weak supervision

    Authors: Debajyoti Datta, Maria Phillips, James P Bywater, Jennifer Chiu, Ginger S. Watson, Laura E. Barnes, Donald E Brown

    Abstract: A large body of research demonstrates how teachers' questioning strategies can improve student learning outcomes. However, developing new scenarios is challenging because of the lack of training data for a specific scenario and the costs associated with labeling. This paper presents a high-fidelity, AI-based classroom simulator to help teachers rehearse research-based mathematical questioning skil… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

    Comments: Accepted to appear at the NeurIPS 2021 Workshop on Math AI for Education (MATHAI4ED)

  33. arXiv:2106.11077  [pdf, other

    cs.CY cs.CL

    Toward a Knowledge Discovery Framework for Data Science Job Market in the United States

    Authors: Mojtaba Heidarysafa, Kamran Kowsari, Masoud Bashiri, Donald E. Brown

    Abstract: The growth of the data science field requires better tools to understand such a fast-paced growing domain. Moreover, individuals from different backgrounds became interested in following a career as data scientists. Therefore, providing a quantitative guide for individuals and organizations to understand the skills required in the job market would be crucial. This paper introduces a framework to a… ▽ More

    Submitted 20 July, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

  34. arXiv:2106.07068  [pdf, other

    cs.CV

    HistoTransfer: Understanding Transfer Learning for Histopathology

    Authors: Yash Sharma, Lubaina Ehsan, Sana Syed, Donald E. Brown

    Abstract: Advancement in digital pathology and artificial intelligence has enabled deep learning-based computer vision techniques for automated disease diagnosis and prognosis. However, WSIs present unique computational and algorithmic challenges. WSIs are gigapixel-sized, making them infeasible to be used directly for training deep neural networks. Hence, for modeling, a two-stage approach is adopted: Patc… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

    Comments: Accepted at IEEE International Conference on Biomedical and Health Informatics (BHI'21). arXiv admin note: text overlap with arXiv:2103.10626

  35. arXiv:2103.10626  [pdf, other

    eess.IV cs.CV cs.LG

    Cluster-to-Conquer: A Framework for End-to-End Multi-Instance Learning for Whole Slide Image Classification

    Authors: Yash Sharma, Aman Shrivastava, Lubaina Ehsan, Christopher A. Moskaluk, Sana Syed, Donald E. Brown

    Abstract: In recent years, the availability of digitized Whole Slide Images (WSIs) has enabled the use of deep learning-based computer vision techniques for automated disease diagnosis. However, WSIs present unique computational and algorithmic challenges. WSIs are gigapixel-sized ($\sim$100K pixels), making them infeasible to be used directly for training deep neural networks. Also, often only slide-level… ▽ More

    Submitted 13 June, 2021; v1 submitted 19 March, 2021; originally announced March 2021.

    Comments: Accepted at MIDL, 2021 - https://openreview.net/forum?id=7i1-2oKIELU

  36. arXiv:2101.05326  [pdf, other

    eess.IV cs.CV

    Advancing Eosinophilic Esophagitis Diagnosis and Phenotype Assessment with Deep Learning Computer Vision

    Authors: William Adorno III, Alexis Catalano, Lubaina Ehsan, Hans Vitzhum von Eckstaedt, Barrett Barnes, Emily McGowan, Sana Syed, Donald E. Brown

    Abstract: Eosinophilic Esophagitis (EoE) is an inflammatory esophageal disease which is increasing in prevalence. The diagnostic gold-standard involves manual review of a patient's biopsy tissue sample by a clinical pathologist for the presence of 15 or greater eosinophils within a single high-power field (400x magnification). Diagnosing EoE can be a cumbersome process with added difficulty for assessing th… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

    Comments: This paper contains 12 pages, 9 figures, and 7 tables

  37. arXiv:2008.12333  [pdf, other

    cs.LG cs.AI stat.ML

    Controlling Level of Unconsciousness by Titrating Propofol with Deep Reinforcement Learning

    Authors: Gabe Schamberg, Marcus Badgeley, Emery N. Brown

    Abstract: Reinforcement Learning (RL) can be used to fit a mapping from patient state to a medication regimen. Prior studies have used deterministic and value-based tabular learning to learn a propofol dose from an observed anesthetic state. Deep RL replaces the table with a deep neural network and has been used to learn medication regimens from registry databases. Here we perform the first application of d… ▽ More

    Submitted 9 September, 2020; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: International Conference on Artificial Intelligence in Medicine 2020

  38. arXiv:2006.15747  [pdf, ps, other

    cs.GT cs.DS

    Random Assignment Under Bi-Valued Utilities: Analyzing Hylland-Zeckhauser, Nash-Bargaining, and other Rules

    Authors: Haris Aziz, Ethan Brown

    Abstract: The Hylland-Zeckhauser (HZ) rule is a well-known rule for random assignment of items. The complexity of the rule has received renewed interest recently with Vazirani and Yannakakis (2020) proposing a strongly polynomial-time algorithm for the rule under bi-valued utilities, and making several general insights. We study the rule under the case of agents having bi-valued utilities. We point out seve… ▽ More

    Submitted 2 March, 2021; v1 submitted 28 June, 2020; originally announced June 2020.

    MSC Class: 91A12; 68Q15 ACM Class: F.2; J.4

  39. arXiv:2005.05516  [pdf, other

    cs.GT cs.IR cs.MA eess.SY

    Framing Effects on Strategic Information Design under Receiver Distrust and Unknown State

    Authors: Doris E. M. Brown, Venkata Sriram Siddhardh Nadendla

    Abstract: Strategic information design is a framework where a sender designs information strategically to steer its receiver's decision towards a desired choice. Traditionally, such frameworks have always assumed that the sender and the receiver comprehends the state of the choice environment, and that the receiver always trusts the sender's signal. This paper deviates from these assumptions and re-investig… ▽ More

    Submitted 21 July, 2021; v1 submitted 11 May, 2020; originally announced May 2020.

    Comments: 12 pages, 5 figures; This is a working draft, and can potentially have errors. Any feedback will be greatly appreciated, and will be acknowledged in the subsequent versions

  40. arXiv:2005.03868  [pdf, other

    eess.IV cs.LG stat.ML

    Hierarchical Deep Convolutional Neural Networks for Multi-category Diagnosis of Gastrointestinal Disorders on Histopathological Images

    Authors: Rasoul Sali, Sodiq Adewole, Lubaina Ehsan, Lee A. Denson, Paul Kelly, Beatrice C. Amadi, Lori Holtz, Syed Asad Ali, Sean R. Moore, Sana Syed, Donald E. Brown

    Abstract: Deep convolutional neural networks(CNNs) have been successful for a wide range of computer vision tasks, including image classification. A specific area of the application lies in digital pathology for pattern recognition in the tissue-based diagnosis of gastrointestinal(GI) diseases. This domain can utilize CNNs to translate histopathological images into precise diagnostics. This is challenging s… ▽ More

    Submitted 6 August, 2020; v1 submitted 8 May, 2020; originally announced May 2020.

    Comments: accepted at IEEE International Conference on Healthcare Informatics (ICHI 2020)

  41. arXiv:2004.06997  [pdf, ps, other

    cs.LO cs.LG

    Prolog Technology Reinforcement Learning Prover

    Authors: Zsolt Zombori, Josef Urban, Chad E. Brown

    Abstract: We present a reinforcement learning toolkit for experiments with guiding automated theorem proving in the connection calculus. The core of the toolkit is a compact and easy to extend Prolog-based automated theorem prover called plCoP. plCoP builds on the leanCoP Prolog implementation and adds learning-guided Monte-Carlo Tree Search as done in the rlCoP system. Other components include a Python int… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

  42. arXiv:2004.06518  [pdf, other

    cs.SI cs.AI cs.CL cs.LG stat.ML

    Gender Detection on Social Networks using Ensemble Deep Learning

    Authors: Kamran Kowsari, Mojtaba Heidarysafa, Tolu Odukoya, Philip Potter, Laura E. Barnes, Donald E. Brown

    Abstract: Analyzing the ever-increasing volume of posts on social media sites such as Facebook and Twitter requires improved information processing methods for profiling authorship. Document classification is central to this task, but the performance of traditional supervised classifiers has degraded as the volume of social media has increased. This paper addresses this problem in the context of gender dete… ▽ More

    Submitted 9 September, 2020; v1 submitted 13 April, 2020; originally announced April 2020.

  43. arXiv:2002.00223  [pdf, other

    cs.CY cs.AI

    Dialogue-Based Simulation For Cultural Awareness Training

    Authors: Sodiq Adewole, Erfaneh Gharavi, Benjamin Shpringer, Martin Bolger, Vaibhav Sharma, Sung Ming Yang, Donald E. Brown

    Abstract: Existing simulations designed for cultural and interpersonal skill training rely on pre-defined responses with a menu option selection interface. Using a multiple-choice interface and restricting trainees' responses may limit the trainees' ability to apply the lessons in real life situations. This systems also uses a simplistic evaluation model, where trainees' selected options are marked as eithe… ▽ More

    Submitted 18 October, 2021; v1 submitted 1 February, 2020; originally announced February 2020.

  44. arXiv:1912.03804  [pdf, other

    cs.CL cs.IR

    Women in ISIS Propaganda: A Natural Language Processing Analysis of Topics and Emotions in a Comparison with Mainstream Religious Group

    Authors: Mojtaba Heidarysafa, Kamran Kowsari, Tolu Odukoya, Philip Potter, Laura E. Barnes, Donald E. Brown

    Abstract: Online propaganda is central to the recruitment strategies of extremist groups and in recent years these efforts have increasingly extended to women. To investigate ISIS' approach to targeting women in their online propaganda and uncover implications for counterterrorism, we rely on text mining and natural language processing (NLP). Specifically, we extract articles published in Dabiq and Rumiyah… ▽ More

    Submitted 8 December, 2019; originally announced December 2019.

  45. arXiv:1912.01525  [pdf, ps, other

    cs.AI cs.LO

    Self-Learned Formula Synthesis in Set Theory

    Authors: Chad E. Brown, Thibault Gauthier

    Abstract: A reinforcement learning algorithm accomplishes the task of synthesizing a set-theoretical formula that evaluates to given truth values for given assignments.

    Submitted 3 December, 2019; originally announced December 2019.

  46. arXiv:1911.04873  [pdf, ps, other

    cs.AI cs.CL cs.LG

    Can Neural Networks Learn Symbolic Rewriting?

    Authors: Bartosz Piotrowski, Josef Urban, Chad E. Brown, Cezary Kaliszyk

    Abstract: This work investigates if the current neural architectures are adequate for learning symbolic rewriting. Two kinds of data sets are proposed for this research -- one based on automated proofs and the other being a synthetic set of polynomial terms. The experiments with use of the current neural machine translation models are performed and its results are discussed. Ideas for extending this line of… ▽ More

    Submitted 26 May, 2020; v1 submitted 7 November, 2019; originally announced November 2019.

  47. arXiv:1910.08501  [pdf

    stat.ML cs.LG eess.SP stat.AP

    Classification of spherical objects based on the form function of acoustic echoes

    Authors: Mariia Dmitrieva, Keith E. Brown, Gary J. Heald, David M. Lane

    Abstract: One way to recognise an object is to study how the echo has been shaped during the interaction with the target. Wideband sonar allows the study of the energy distribution for a large range of frequencies. The frequency distribution contains information about an object, including its inner structure. This information is a key for automatic recognition. The scattering by a target can be quantitative… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

    Journal ref: Proc. of the 4th Underwater Acoustics Conference and Exhibition (UACE), Skiathos, Greece, Sept 2017

  48. arXiv:1910.03084  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM stat.ML

    CeliacNet: Celiac Disease Severity Diagnosis on Duodenal Histopathological Images Using Deep Residual Networks

    Authors: Rasoul Sali, Lubaina Ehsan, Kamran Kowsari, Marium Khan, Christopher A. Moskaluk, Sana Syed, Donald E. Brown

    Abstract: Celiac Disease (CD) is a chronic autoimmune disease that affects the small intestine in genetically predisposed children and adults. Gluten exposure triggers an inflammatory cascade which leads to compromised intestinal barrier function. If this enteropathy is unrecognized, this can lead to anemia, decreased bone density, and, in longstanding cases, intestinal cancer. The prevalence of the disorde… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Comments: accepted at IEEE International Conference on Bioinformatics and Biomedicine (IEEE BIBM 2019)

  49. arXiv:1909.01963  [pdf, other

    eess.IV cs.CV q-bio.QM

    Self-Attentive Adversarial Stain Normalization

    Authors: Aman Shrivastava, Will Adorno, Yash Sharma, Lubaina Ehsan, S. Asad Ali, Sean R. Moore, Beatrice C. Amadi, Paul Kelly, Sana Syed, Donald E. Brown

    Abstract: Hematoxylin and Eosin (H&E) stained Whole Slide Images (WSIs) are utilized for biopsy visualization-based diagnostic and prognostic assessment of diseases. Variation in the H&E staining process across different lab sites can lead to significant variations in biopsy image appearance. These variations introduce an undesirable bias when the slides are examined by pathologists or used for training dee… ▽ More

    Submitted 22 November, 2020; v1 submitted 4 September, 2019; originally announced September 2019.

    Comments: Accepted at AIDP (ICPR 2021)

  50. arXiv:1908.03272  [pdf, other

    q-bio.QM cs.CV eess.IV

    Deep Learning for Visual Recognition of Environmental Enteropathy and Celiac Disease

    Authors: Aman Shrivastava, Karan Kant, Saurav Sengupta, Sung-Jun Kang, Marium Khan, Asad Ali, Sean R. Moore, Beatrice C. Amadi, Paul Kelly, Donald E. Brown, Sana Syed

    Abstract: Physicians use biopsies to distinguish between different but histologically similar enteropathies. The range of syndromes and pathologies that could cause different gastrointestinal conditions makes this a difficult problem. Recently, deep learning has been used successfully in helping diagnose cancerous tissues in histopathological images. These successes motivated the research presented in this… ▽ More

    Submitted 8 August, 2019; originally announced August 2019.