Skip to main content

Showing 1–50 of 68 results for author: Roberts, M

  1. arXiv:2406.19314  [pdf, other

    cs.CL cs.AI cs.LG

    LiveBench: A Challenging, Contamination-Free LLM Benchmark

    Authors: Colin White, Samuel Dooley, Manley Roberts, Arka Pal, Ben Feuer, Siddhartha Jain, Ravid Shwartz-Ziv, Neel Jain, Khalid Saifullah, Siddartha Naidu, Chinmay Hegde, Yann LeCun, Tom Goldstein, Willie Neiswanger, Micah Goldblum

    Abstract: Test set contamination, wherein test data from a benchmark ends up in a newer model's training set, is a well-documented obstacle for fair LLM evaluation and can quickly render benchmarks obsolete. To mitigate this, many recent benchmarks crowdsource new prompts and evaluations from human or LLM judges; however, these can introduce significant biases, and break down when scoring hard questions. In… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.10086  [pdf

    cs.CL cs.LG stat.ME

    Discovering influential text using convolutional neural networks

    Authors: Megan Ayers, Luke Sanford, Margaret Roberts, Eddie Yang

    Abstract: Experimental methods for estimating the impacts of text on human evaluation have been widely used in the social sciences. However, researchers in experimental settings are usually limited to testing a small number of pre-specified text treatments. While efforts to mine unstructured texts for features that causally affect outcomes have been ongoing in recent years, these models have primarily focus… ▽ More

    Submitted 21 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: To be published in ACL 2024 Findings

  3. arXiv:2406.08391  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Large Language Models Must Be Taught to Know What They Don't Know

    Authors: Sanyam Kapoor, Nate Gruver, Manley Roberts, Katherine Collins, Arka Pal, Umang Bhatt, Adrian Weller, Samuel Dooley, Micah Goldblum, Andrew Gordon Wilson

    Abstract: When using large language models (LLMs) in high-stakes applications, we need to know when we can trust their predictions. Some works argue that prompting high-performance LLMs is sufficient to produce calibrated uncertainties, while others introduce sampling methods that can be prohibitively expensive. In this work, we first argue that prompting on its own is insufficient to achieve good calibrati… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Code available at: https://github.com/activatedgeek/calibration-tuning

  4. arXiv:2405.19224  [pdf, other

    eess.IV cs.CV

    A study on the adequacy of common IQA measures for medical images

    Authors: Anna Breger, Clemens Karner, Ian Selby, Janek Gröhl, Sören Dittmer, Edward Lilley, Judith Babar, Jake Beckford, Timothy J Sadler, Shahab Shahipasand, Arthikkaa Thavakumar, Michael Roberts, Carola-Bibiane Schönlieb

    Abstract: Image quality assessment (IQA) is standard practice in the development stage of novel machine learning algorithms that operate on images. The most commonly used IQA measures have been developed and tested for natural images, but not in the medical setting. Reported inconsistencies arising in medical images are not surprising, as they have different properties than natural images. In this study, we… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  5. arXiv:2405.19097  [pdf, other

    eess.IV cs.CV

    A study of why we need to reassess full reference image quality assessment with medical images

    Authors: Anna Breger, Ander Biguri, Malena Sabaté Landman, Ian Selby, Nicole Amberg, Elisabeth Brunner, Janek Gröhl, Sepideh Hatamikia, Clemens Karner, Lipeng Ning, Sören Dittmer, Michael Roberts, AIX-COVNET Collaboration, Carola-Bibiane Schönlieb

    Abstract: Image quality assessment (IQA) is not just indispensable in clinical practice to ensure high standards, but also in the development stage of novel algorithms that operate on medical images with reference data. This paper provides a structured and comprehensive collection of examples where the two most common full reference (FR) image quality measures prove to be unsuitable for the assessment of no… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  6. arXiv:2405.19000  [pdf, other

    cs.LG

    FedMAP: Unlocking Potential in Personalized Federated Learning through Bi-Level MAP Optimization

    Authors: Fan Zhang, Carlos Esteve-Yagüe, Sören Dittmer, Carola-Bibiane Schönlieb, Michael Roberts

    Abstract: Federated Learning (FL) enables collaborative training of machine learning models on decentralized data while preserving data privacy. However, data across clients often differs significantly due to class imbalance, feature distribution skew, sample size imbalance, and other phenomena. Leveraging information from these not identically distributed (non-IID) datasets poses substantial challenges. FL… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  7. arXiv:2405.09597  [pdf

    cs.LG cs.AI

    When AI Eats Itself: On the Caveats of Data Pollution in the Era of Generative AI

    Authors: Xiaodan Xing, Fadong Shi, Jiahao Huang, Yinzhe Wu, Yang Nan, Sheng Zhang, Yingying Fang, Mike Roberts, Carola-Bibiane Schönlieb, Javier Del Ser, Guang Yang

    Abstract: Generative artificial intelligence (AI) technologies and large models are producing realistic outputs across various domains, such as images, text, speech, and music. Creating these advanced generative models requires significant resources, particularly large and high-quality datasets. To minimize training expenses, many algorithm developers use data created by the models themselves as a cost-effe… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  8. arXiv:2404.06325  [pdf, other

    cs.AI

    Automatically Learning HTN Methods from Landmarks

    Authors: Ruoxi Li, Dana Nau, Mark Roberts, Morgan Fine-Morris

    Abstract: Hierarchical Task Network (HTN) planning usually requires a domain engineer to provide manual input about how to decompose a planning problem. Even HTN-MAKER, a well-known method-learning algorithm, requires a domain engineer to annotate the tasks with information about what to learn. We introduce CURRICULAMA, an HTN method learning algorithm that completely automates the learning process. It uses… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: This work has been submitted to FLAIRS-24

  9. arXiv:2403.15755  [pdf, other

    stat.ME cs.MA cs.SI stat.AP

    Optimized Model Selection for Estimating Treatment Effects from Costly Simulations of the US Opioid Epidemic

    Authors: Abdulrahman A. Ahmed, M. Amin Rahimian, Mark S. Roberts

    Abstract: Agent-based simulation with a synthetic population can help us compare different treatment conditions while keeping everything else constant within the same population (i.e., as digital twins). Such population-scale simulations require large computational power (i.e., CPU resources) to get accurate estimates for treatment effects. We can use meta models of the simulation results to circumvent the… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: To be presented in 2024 Annual Simulation Conference (ANNSIM'24)

  10. Goal-Oriented End-User Programming of Robots

    Authors: David Porfirio, Mark Roberts, Laura M. Hiatt

    Abstract: End-user programming (EUP) tools must balance user control with the robot's ability to plan and act autonomously. Many existing task-oriented EUP tools enforce a specific level of control, e.g., by requiring that users hand-craft detailed sequences of actions, rather than offering users the flexibility to choose the level of task detail they wish to express. We thereby created a novel EUP system,… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Published in the proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction

  11. arXiv:2402.17836  [pdf, other

    cs.RO

    Considerations for End-User Development in the Caregiving Domain

    Authors: Laura Stegner, David Porfirio, Mark Roberts, Laura M. Hiatt

    Abstract: As service robots become more capable of autonomous behaviors, it becomes increasingly important to consider how people communicate with a robot what task it should perform and how to do the task. Accordingly, there has been a rise in attention to end-user development (EUD) interfaces, which enable non-roboticist end users to specify tasks for autonomous robots to perform. However, state-of-the-ar… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Presented at AAAI Fall Symposium Series 2023 UR-RAD

  12. arXiv:2402.13228  [pdf, other

    cs.CL cs.AI cs.LG

    Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive

    Authors: Arka Pal, Deep Karkhanis, Samuel Dooley, Manley Roberts, Siddartha Naidu, Colin White

    Abstract: Direct Preference Optimisation (DPO) is effective at significantly improving the performance of large language models (LLMs) on downstream tasks such as reasoning, summarisation, and alignment. Using pairs of preferred and dispreferred data, DPO models the relative probability of picking one response over another. In this work, first we show theoretically that the standard DPO loss can lead to a r… ▽ More

    Submitted 3 July, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  13. arXiv:2402.10224  [pdf, other

    cs.RO cs.AI cs.MA

    Human-Centric Goal Reasoning with Ripple-Down Rules

    Authors: Kenji Brameld, Germán Castro, Claude Sammut, Mark Roberts, David W. Aha

    Abstract: ActorSim is a goal reasoning framework developed at the Naval Research Laboratory. Originally, all goal reasoning rules were hand-crafted. This work extends ActorSim with the capability of learning by demonstration, that is, when a human trainer disagrees with a decision made by the system, the trainer can take over and show the system the correct decision. The learning component uses Ripple-Down… ▽ More

    Submitted 30 January, 2024; originally announced February 2024.

    Comments: Proceedings of the Ninth Goal Reasoning Workshop (Advances in Cognitive Systems, 2021)

  14. arXiv:2312.16188  [pdf, other

    cs.LG stat.ME

    The curious case of the test set AUROC

    Authors: Michael Roberts, Alon Hazan, Sören Dittmer, James H. F. Rudd, Carola-Bibiane Schönlieb

    Abstract: Whilst the size and complexity of ML models have rapidly and significantly increased over the past decade, the methods for assessing their performance have not kept pace. In particular, among the many potential performance metrics, the ML community stubbornly continues to use (a) the area under the receiver operating characteristic curve (AUROC) for a validation and test cohort (distinct from trai… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 3 pages, 4 figures

  15. arXiv:2312.12482  [pdf, other

    q-bio.QM cs.LG

    New Horizons: Pioneering Pharmaceutical R&D with Generative AI from lab to the clinic -- an industry perspective

    Authors: Guy Doron, Sam Genway, Mark Roberts, Sai Jasti

    Abstract: The rapid advance of generative AI is reshaping the strategic vision for R&D across industries. The unique challenges of pharmaceutical R&D will see applications of generative AI deliver value along the entire value chain from early discovery to regulatory approval. This perspective reviews these challenges and takes a three-horizon approach to explore the generative AI applications already delive… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 21 pages, 4 figures

    MSC Class: 92C50 ACM Class: I.2.0; J.3

  16. arXiv:2310.10628  [pdf, other

    cs.CL

    Data Contamination Through the Lens of Time

    Authors: Manley Roberts, Himanshu Thakur, Christine Herlihy, Colin White, Samuel Dooley

    Abstract: Recent claims about the impressive abilities of large language models (LLMs) are often supported by evaluating publicly available benchmarks. Since LLMs train on wide swaths of the internet, this practice raises concerns of data contamination, i.e., evaluating on examples that are explicitly or implicitly included in the training data. Data contamination remains notoriously challenging to measure… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  17. arXiv:2310.02874  [pdf, other

    cs.LG cs.AI

    Recent Methodological Advances in Federated Learning for Healthcare

    Authors: Fan Zhang, Daniel Kreuter, Yichen Chen, Sören Dittmer, Samuel Tull, Tolou Shadbahr, BloodCounts! Collaboration, Jacobus Preller, James H. F. Rudd, John A. D. Aston, Carola-Bibiane Schönlieb, Nicholas Gleadall, Michael Roberts

    Abstract: For healthcare datasets, it is often not possible to combine data samples from multiple sites due to ethical, privacy or logistical concerns. Federated learning allows for the utilisation of powerful machine learning algorithms without requiring the pooling of data. Healthcare data has many simultaneous challenges which require new methodologies to address, such as highly-siloed data, class imbala… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: Supplementary table of extracted data at the end of the document

  18. arXiv:2308.13040  [pdf, other

    cs.MA cs.SI stat.AP

    Estimating Treatment Effects Using Costly Simulation Samples from a Population-Scale Model of Opioid Use Disorder

    Authors: Abdulrahman A. Ahmed, M. Amin Rahimian, Mark S. Roberts

    Abstract: Large-scale models require substantial computational resources for analysis and studying treatment conditions. Specifically, estimating treatment effects using simulations may require a lot of infeasible resources to allocate at every treatment condition. Therefore, it is essential to develop efficient methods to allocate computational resources for estimating treatment effects. Agent-based simula… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: To be presented in IEEE International Conference on Biomedical and Health Informatics 2023, repository link: https://github.com/abdulrahmanfci/intervention-estimation

  19. arXiv:2308.10882  [pdf, other

    cs.AI cs.CL

    Giraffe: Adventures in Expanding Context Lengths in LLMs

    Authors: Arka Pal, Deep Karkhanis, Manley Roberts, Samuel Dooley, Arvind Sundararajan, Siddartha Naidu

    Abstract: Modern large language models (LLMs) that rely on attention mechanisms are typically trained with fixed context lengths which enforce upper limits on the length of input sequences that they can handle at evaluation time. To use these models on sequences longer than the train-time context length, one might employ techniques from the growing family of context length extrapolation methods -- most of w… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  20. arXiv:2308.07832  [pdf, ps, other

    cs.LG cs.AI stat.ME

    REFORMS: Reporting Standards for Machine Learning Based Science

    Authors: Sayash Kapoor, Emily Cantrell, Kenny Peng, Thanh Hien Pham, Christopher A. Bail, Odd Erik Gundersen, Jake M. Hofman, Jessica Hullman, Michael A. Lones, Momin M. Malik, Priyanka Nanayakkara, Russell A. Poldrack, Inioluwa Deborah Raji, Michael Roberts, Matthew J. Salganik, Marta Serra-Garcia, Brandon M. Stewart, Gilles Vandewiele, Arvind Narayanan

    Abstract: Machine learning (ML) methods are proliferating in scientific research. However, the adoption of these methods has been accompanied by failures of validity, reproducibility, and generalizability. These failures can hinder scientific progress, lead to false consensus around invalid claims, and undermine the credibility of ML-based science. ML methods are often applied and fail in similar ways acros… ▽ More

    Submitted 19 September, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

  21. arXiv:2307.13579  [pdf, other

    cs.LG cs.AI math.ST

    Reinterpreting survival analysis in the universal approximator age

    Authors: Sören Dittmer, Michael Roberts, Jacobus Preller, AIX COVNET, James H. F. Rudd, John A. D. Aston, Carola-Bibiane Schönlieb

    Abstract: Survival analysis is an integral part of the statistical toolbox. However, while most domains of classical statistics have embraced deep learning, survival analysis only recently gained some minor attention from the deep learning community. This recent development is likely in part motivated by the COVID-19 pandemic. We aim to provide the tools needed to fully harness the potential of survival ana… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  22. arXiv:2307.12186  [pdf, other

    cs.MA cs.SI stat.AP

    Inferring epidemic dynamics using Gaussian process emulation of agent-based simulations

    Authors: Abdulrahman A. Ahmed, M. Amin Rahimian, Mark S. Roberts

    Abstract: Computational models help decision makers understand epidemic dynamics to optimize public health interventions. Agent-based simulation of disease spread in synthetic populations allows us to compare and contrast different effects across identical populations or to investigate the effect of interventions keeping every other factor constant between ``digital twins''. FRED (A Framework for Reconstruc… ▽ More

    Submitted 11 September, 2023; v1 submitted 22 July, 2023; originally announced July 2023.

    Comments: To be presented in Winter Simulation Conference 2023, repository link: https://github.com/abdulrahmanfci/gpr-abm

  23. arXiv:2306.09177  [pdf, other

    cs.LG

    Dis-AE: Multi-domain & Multi-task Generalisation on Real-World Clinical Data

    Authors: Daniel Kreuter, Samuel Tull, Julian Gilbey, Jacobus Preller, BloodCounts! Consortium, John A. D. Aston, James H. F. Rudd, Suthesh Sivapalaratnam, Carola-Bibiane Schönlieb, Nicholas Gleadall, Michael Roberts

    Abstract: Clinical data is often affected by clinically irrelevant factors such as discrepancies between measurement devices or differing processing methods between sites. In the field of machine learning (ML), these factors are known as domains and the distribution differences they cause in the data are known as domain shifts. ML models trained using data from one domain often perform poorly when applied t… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 17 pages main body, 5 figures, 18 pages of appendix

  24. arXiv:2305.09035  [pdf, other

    cs.LG

    Algorithmic Censoring in Dynamic Learning Systems

    Authors: Jennifer Chien, Margaret Roberts, Berk Ustun

    Abstract: Dynamic learning systems subject to selective labeling exhibit censoring, i.e. persistent negative predictions assigned to one or more subgroups of points. In applications like consumer finance, this results in groups of applicants that are persistently denied and thus never enter into the training data. In this work, we formalize censoring, demonstrate how it can arise, and highlight difficulties… ▽ More

    Submitted 29 June, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: 28 pages, 9 figures

  25. arXiv:2211.09970  [pdf, other

    cs.LG cs.AI cs.CE physics.data-an

    Estimating defection in subscription-type markets: empirical analysis from the scholarly publishing industry

    Authors: Michael Roberts, J. Ignacio Deza, Hisham Ihshaish, Yanhui Zhu

    Abstract: We present the first empirical study on customer churn prediction in the scholarly publishing industry. The study examines our proposed method for prediction on a customer subscription data over a period of 6.5 years, which was provided by a major academic publisher. We explore the subscription-type market within the context of customer defection and modelling, and provide analysis of the business… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  26. Navigating the challenges in creating complex data systems: a development philosophy

    Authors: Sören Dittmer, Michael Roberts, Julian Gilbey, Ander Biguri, AIX-COVNET Collaboration, Jacobus Preller, James H. F. Rudd, John A. D. Aston, Carola-Bibiane Schönlieb

    Abstract: In this perspective, we argue that despite the democratization of powerful tools for data science and machine learning over the last decade, developing the code for a trustworthy and effective data science system (DSS) is getting harder. Perverse incentives and a lack of widespread software engineering (SE) skills are among many root causes we identify that naturally give rise to the current syste… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

  27. arXiv:2210.09465  [pdf, other

    cs.CV cs.LG

    Understanding CNN Fragility When Learning With Imbalanced Data

    Authors: Damien Dablain, Kristen N. Jacobson, Colin Bellinger, Mark Roberts, Nitesh Chawla

    Abstract: Convolutional neural networks (CNNs) have achieved impressive results on imbalanced image data, but they still have difficulty generalizing to minority classes and their decisions are difficult to interpret. These problems are related because the method by which CNNs generalize to minority classes, which requires improvement, is wrapped in a blackbox. To demystify CNN decisions on imbalanced data,… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  28. arXiv:2210.06849  [pdf, other

    cs.CV

    Retrospectives on the Embodied AI Workshop

    Authors: Matt Deitke, Dhruv Batra, Yonatan Bisk, Tommaso Campari, Angel X. Chang, Devendra Singh Chaplot, Changan Chen, Claudia Pérez D'Arpino, Kiana Ehsani, Ali Farhadi, Li Fei-Fei, Anthony Francis, Chuang Gan, Kristen Grauman, David Hall, Winson Han, Unnat Jain, Aniruddha Kembhavi, Jacob Krantz, Stefan Lee, Chengshu Li, Sagnik Majumder, Oleksandr Maksymets, Roberto Martín-Martín, Roozbeh Mottaghi , et al. (14 additional authors not shown)

    Abstract: We present a retrospective on the state of Embodied AI research. Our analysis focuses on 13 challenges presented at the Embodied AI Workshop at CVPR. These challenges are grouped into three themes: (1) visual navigation, (2) rearrangement, and (3) embodied vision-and-language. We discuss the dominant datasets within each theme, evaluation metrics for the challenges, and the performance of state-of… ▽ More

    Submitted 4 December, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

  29. arXiv:2207.13179  [pdf, other

    cs.LG stat.ML

    Unsupervised Learning under Latent Label Shift

    Authors: Manley Roberts, Pranav Mani, Saurabh Garg, Zachary C. Lipton

    Abstract: What sorts of structure might enable a learner to discover classes from unlabeled data? Traditional approaches rely on feature-space similarity and heroic assumptions on the data. In this paper, we introduce unsupervised learning under Latent Label Shift (LLS), where we have access to unlabeled data from multiple domains such that the label marginals $p_d(y)$ can shift across domains but the class… ▽ More

    Submitted 1 December, 2022; v1 submitted 26 July, 2022; originally announced July 2022.

    Comments: NeurIPS 2022. Manley Roberts and Pranav Mani contributed equally to this work

  30. Classification of datasets with imputed missing values: does imputation quality matter?

    Authors: Tolou Shadbahr, Michael Roberts, Jan Stanczuk, Julian Gilbey, Philip Teare, Sören Dittmer, Matthew Thorpe, Ramon Vinas Torne, Evis Sala, Pietro Lio, Mishal Patel, AIX-COVNET Collaboration, James H. F. Rudd, Tuomas Mirtti, Antti Rannikko, John A. D. Aston, Jing Tang, Carola-Bibiane Schönlieb

    Abstract: Classifying samples in incomplete datasets is a common aim for machine learning practitioners, but is non-trivial. Missing data is found in most real-world datasets and these missing values are typically imputed using established methods, followed by classification of the now complete, imputed, samples. The focus of the machine learning researcher is then to optimise the downstream classification… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: 17 pages, 10 figures, 30 supplementary pages

  31. arXiv:2201.06505  [pdf

    cs.AI cs.CV

    Data Harmonisation for Information Fusion in Digital Healthcare: A State-of-the-Art Systematic Review, Meta-Analysis and Future Research Directions

    Authors: Yang Nan, Javier Del Ser, Simon Walsh, Carola Schönlieb, Michael Roberts, Ian Selby, Kit Howard, John Owen, Jon Neville, Julien Guiot, Benoit Ernst, Ana Pastor, Angel Alberich-Bayarri, Marion I. Menzel, Sean Walsh, Wim Vos, Nina Flerin, Jean-Paul Charbonnier, Eva van Rikxoort, Avishek Chatterjee, Henry Woodruff, Philippe Lambin, Leonor Cerdá-Alberich, Luis Martí-Bonmatí, Francisco Herrera , et al. (1 additional authors not shown)

    Abstract: Removing the bias and variance of multicentre data has always been a challenge in large scale digital healthcare studies, which requires the ability to integrate clinical features extracted from data acquired by different scanners and protocols to improve stability and robustness. Previous studies have described various computational approaches to fuse single modality multicentre datasets. However… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

    Comments: 54 pages, 14 figures, accepted by the Information Fusion journal

  32. Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence

    Authors: Xiang Bai, Hanchen Wang, Liya Ma, Yongchao Xu, Jiefeng Gan, Ziwei Fan, Fan Yang, Ke Ma, Jiehua Yang, Song Bai, Chang Shu, Xinyu Zou, Renhao Huang, Changzheng Zhang, Xiaowu Liu, Dandan Tu, Chuou Xu, Wenqing Zhang, Xi Wang, Anguo Chen, Yu Zeng, Dehua Yang, Ming-Wei Wang, Nagaraj Holalkere, Neil J. Halin , et al. (21 additional authors not shown)

    Abstract: Artificial intelligence (AI) provides a promising substitution for streamlining COVID-19 diagnoses. However, concerns surrounding security and trustworthiness impede the collection of large-scale representative medical data, posing a considerable challenge for training a well-generalised model in clinical practices. To address this, we launch the Unified CT-COVID AI Diagnostic Initiative (UCADI),… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: Nature Machine Intelligence

  33. arXiv:2110.00239  [pdf, other

    math.CT cs.LO math.LO

    Substructural fixed-point theorems and the diagonal argument: theme and variations

    Authors: David Michael Roberts

    Abstract: This article re-examines Lawvere's abstract, category-theoretic proof of the fixed-point theorem whose contrapositive is a `universal' diagonal argument. The main result is that the necessary axioms for both the fixed-point theorem and the diagonal argument can be stripped back further, to a semantic analogue of a weak substructural logic lacking weakening or exchange.

    Submitted 9 August, 2023; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: v1 20 pages; v2 22 pages, added additional final section on fixed-point operators; v3 final journal version

    MSC Class: 03B47; 18A15 ACM Class: F.4.1

    Journal ref: Compositionality 5, 8 (2023)

  34. arXiv:2109.00908  [pdf, ps, other

    cs.IT math.CO

    Binary self-dual codes of various lengths with new weight enumerators from a modified bordered construction and neighbours

    Authors: Joe Gildea, Adrian Korban, Adam Michael Roberts, Alexander Tylyshchak

    Abstract: In this work, we define a modification of a bordered construction for self-dual codes which utilises $λ$-circulant matrices. We provide the necessary conditions for the construction to produce self-dual codes over finite commutative Frobenius rings of characteristic 2. Using the modified construction together with the neighbour construction, we construct many binary self-dual codes of lengths 54,… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2108.09184, arXiv:2106.12355, arXiv:2102.10354

    MSC Class: 94B05; 15B10; 15B33

  35. arXiv:2109.00725  [pdf, other

    cs.CL cs.LG

    Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

    Authors: Amir Feder, Katherine A. Keith, Emaad Manzoor, Reid Pryzant, Dhanya Sridhar, Zach Wood-Doughty, Jacob Eisenstein, Justin Grimmer, Roi Reichart, Margaret E. Roberts, Brandon M. Stewart, Victor Veitch, Diyi Yang

    Abstract: A fundamental goal of scientific research is to learn about causal relationships. However, despite its critical role in the life and social sciences, causality has not had the same importance in Natural Language Processing (NLP), which has traditionally placed more emphasis on predictive tasks. This distinction is beginning to fade, with an emerging area of interdisciplinary research at the conver… ▽ More

    Submitted 30 July, 2022; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: Accepted to Transactions of the Association for Computational Linguistics (TACL)

  36. New binary self-dual codes of lengths 56, 62, 78, 92 and 94 from a bordered construction

    Authors: Joe Gildea, Adrian Korban, Adam Michael Roberts, Alexander Tylyshchak

    Abstract: In this paper, we present a new bordered construction for self-dual codes which employs $λ$-circulant matrices. We give the necessary conditions for our construction to produce self-dual codes over a finite commutative Frobenius ring of characteristic 2. Moreover, using our bordered construction together with the well-known building-up and neighbour methods, we construct many binary self-dual code… ▽ More

    Submitted 3 February, 2022; v1 submitted 20 August, 2021; originally announced August 2021.

    Comments: corrected typos; other minor corrections. arXiv admin note: substantial text overlap with arXiv:2102.10354, arXiv:2106.12355, arXiv:2102.12326

    MSC Class: 94B05; 15B10; 15B33

  37. arXiv:2108.05056  [pdf, ps, other

    cs.IT

    Group LCD and Group Reversible LCD Codes

    Authors: Steven T. Dougherty, Joe Gildea, Adrian Korban, Adam M. Roberts

    Abstract: In this paper, we give a new method for constructing LCD codes. We employ group rings and a well known map that sends group ring elements to a subring of the $n \times n$ matrices to obtain LCD codes. Our construction method guarantees that our LCD codes are also group codes, namely, the codes are ideals in a group ring. We show that with a certain condition on the group ring element $v,$ one can… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

    Comments: 17 pages

    MSC Class: 94B05

  38. New binary self-dual codes of lengths 80, 84 and 96 from composite matrices

    Authors: Joe Gildea, Adrian Korban, Adam Michael Roberts

    Abstract: In this work, we apply the idea of composite matrices arising from group rings to derive a number of different techniques for constructing self-dual codes over finite commutative Frobenius rings. By applying these techniques over different alphabets, we construct best known singly-even binary self-dual codes of lengths 80, 84 and 96 as well as doubly-even binary self-dual codes of length 96 that w… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: arXiv admin note: text overlap with arXiv:2102.10354

    MSC Class: 94B05; 16S34; 15B10; 15B33

  39. Quaternary Hermitian self-dual codes of lengths 26, 32, 36, 38 and 40 from modifications of well-known circulant constructions

    Authors: Adam Michael Roberts

    Abstract: In this work, we give three new techniques for constructing Hermitian self-dual codes over commutative Frobenius rings with a non-trivial involutory automorphism using $λ$-circulant matrices. The new constructions are derived as modifications of various well-known circulant constructions of self-dual codes. Applying these constructions together with the building-up construction, we construct many… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2102.10354

  40. New binary self-dual codes of lengths 56, 58, 64, 80 and 92 from a modification of the four circulant construction

    Authors: Joe Gildea, Adrian Korban, Adam Michael Roberts

    Abstract: In this work, we give a new technique for constructing self-dual codes over commutative Frobenius rings using $λ$-circulant matrices. The new construction was derived as a modification of the well-known four circulant construction of self-dual codes. Applying this technique together with the building-up construction, we construct singly-even binary self-dual codes of lengths 56, 58, 64, 80 and 92… ▽ More

    Submitted 23 June, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

    Comments: corrected typos; added references

    MSC Class: 94B05; 15B10; 15B33

  41. arXiv:2102.09667  [pdf, other

    cs.LO

    Semantics and Axiomatization for Stochastic Differential Dynamic Logic

    Authors: Michael Roberts, Alexei Kopylov, Aleksey Nogin

    Abstract: Building on previous work by André Platzer, we present a formal language for Stochastic Differential Dynamic Logic, and define its semantics, axioms and inference rules. Compared to the previous effort, our account of the Stochastic Differential Dynamic Logic follows closer to and is more compatible with the traditional account of the regular Differential Dynamic Logic. We resolve an issue with th… ▽ More

    Submitted 28 April, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

  42. arXiv:2101.09294  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Censorship of Online Encyclopedias: Implications for NLP Models

    Authors: Eddie Yang, Margaret E. Roberts

    Abstract: While artificial intelligence provides the backbone for many tools people use around the world, recent work has brought to attention that the algorithms powering AI are not free of politics, stereotypes, and bias. While most work in this area has focused on the ways in which AI can exacerbate existing inequalities and discrimination, very little work has studied how governments actively shape trai… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

    Comments: Accepted for publication at ACM FAccT 2021

  43. arXiv:2011.13171  [pdf, ps, other

    cs.LO

    Universal Semantics for the Stochastic Lambda-Calculus

    Authors: Pedro Amorim, Dexter Kozen, Radu Mardare, Prakash Panangaden, Michael Roberts

    Abstract: We define sound and adequate denotational and operational semantics for the stochastic lambda calculus. These two semantic approaches build on previous work that used similar techniques to reason about higher-order probabilistic programs, but for the first time admit an adequacy theorem relating the operational and denotational views. This resolves the main issue left open in (Bacci et al. 2018).

    Submitted 14 May, 2021; v1 submitted 26 November, 2020; originally announced November 2020.

    Comments: 14 pages

  44. arXiv:2011.02523  [pdf, other

    cs.CV cs.GR

    Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

    Authors: Mike Roberts, Jason Ramapuram, Anurag Ranjan, Atulit Kumar, Miguel Angel Bautista, Nathan Paczan, Russ Webb, Joshua M. Susskind

    Abstract: For many fundamental scene understanding tasks, it is difficult or impossible to obtain per-pixel ground truth labels from real images. We address this challenge by introducing Hypersim, a photorealistic synthetic dataset for holistic indoor scene understanding. To create our dataset, we leverage a large repository of synthetic scenes created by professional artists, and we generate 77,400 images… ▽ More

    Submitted 17 August, 2021; v1 submitted 4 November, 2020; originally announced November 2020.

    Comments: Accepted for publication at the International Conference on Computer Vision (ICCV) 2021

  45. arXiv:2008.06388  [pdf

    cs.LG cs.CV eess.IV stat.ML

    Common pitfalls and recommendations for using machine learning to detect and prognosticate for COVID-19 using chest radiographs and CT scans

    Authors: Michael Roberts, Derek Driggs, Matthew Thorpe, Julian Gilbey, Michael Yeung, Stephan Ursprung, Angelica I. Aviles-Rivero, Christian Etmann, Cathal McCague, Lucian Beer, Jonathan R. Weir-McCall, Zhongzhao Teng, Effrossyni Gkrania-Klotsas, James H. F. Rudd, Evis Sala, Carola-Bibiane Schönlieb

    Abstract: Machine learning methods offer great promise for fast and accurate detection and prognostication of COVID-19 from standard-of-care chest radiographs (CXR) and computed tomography (CT) images. Many articles have been published in 2020 describing new machine learning-based models for both of these tasks, but it is unclear which are of potential clinical utility. In this systematic review, we search… ▽ More

    Submitted 5 January, 2021; v1 submitted 14 August, 2020; originally announced August 2020.

    Comments: 35 pages, 3 figures, 2 tables, updated to the period 1 January 2020 - 3 October 2020

    Journal ref: Nature Machine Intelligence 3, 199-217 (2021)

  46. arXiv:2005.07641  [pdf, other

    cs.NI

    Watching the Watchers: Nonce-based Inverse Surveillance to Remotely Detect Monitoring

    Authors: Laura M. Roberts, David Plonka

    Abstract: Internet users and service providers do not often know when traffic is being watched but desire a way to determine when, where, and by whom. We present NOISE, the Nonce Observatory for Inverse Surveillance of Eavesdroppers, a method and system that detects monitoring by disseminating nonces - unique, pseudorandom values - in traffic and seeing if they are acted upon unexpectedly, indicating that t… ▽ More

    Submitted 5 June, 2020; v1 submitted 15 May, 2020; originally announced May 2020.

  47. arXiv:2002.11834  [pdf, other

    cs.HC

    Understanding How and Why University Students Use Virtual Private Networks

    Authors: Agnieszka Dutkowska-Zuk, Austin Hounsel, Andre Xiong, Molly Roberts, Brandon Stewart, Marshini Chetty, Nick Feamster

    Abstract: We study how and why university students chose and use VPNs, and whether they are aware of the security and privacy risks that VPNs pose. To answer these questions, we conducted 32 in-person interviews and a survey with 349 respondents, all university students in the United States. We find students are mostly concerned with access to content and privacy concerns were often secondary. They made tra… ▽ More

    Submitted 22 February, 2021; v1 submitted 26 February, 2020; originally announced February 2020.

    Comments: Interview guide, interview summary codebook, survey questions, and additional survey figures included in the appendix document

  48. arXiv:1910.07494  [pdf, other

    cs.CY

    On Constructing a Knowledge Base of Chinese Criminal Cases

    Authors: Xiaohan Wu, Benjamin L. Liebman, Rachel E. Stern, Margaret E. Roberts, Amarnath Gupta

    Abstract: We are developing a knowledge base over Chinese judicial decision documents to facilitate landscape analyses of Chinese Criminal Cases. We view judicial decision documents as a mixed-granularity semi-structured text where different levels of the text carry different semantic constructs and entailments. We use a combination of context-sensitive grammar, dependency parsing and discourse analysis to… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

    Comments: submitted to JURIX 2019

  49. Chan-Vese Reformulation for Selective Image Segmentation

    Authors: Michael Roberts, Jack Spencer

    Abstract: Selective segmentation involves incorporating user input to partition an image into foreground and background, by discriminating between objects of a similar type. Typically, such methods involve introducing additional constraints to generic segmentation approaches. However, we show that this is often inconsistent with respect to common assumptions about the image. The proposed method introduces a… ▽ More

    Submitted 5 July, 2019; v1 submitted 21 November, 2018; originally announced November 2018.

    Comments: To appear in the Journal of Mathematical Imaging and Vision 2019. (23 pages, 19 figures)

  50. arXiv:1810.01488  [pdf, other

    eess.SP cs.LG physics.data-an physics.geo-ph stat.ML

    Using Machine Learning to Discern Eruption in Noisy Environments: A Case Study using CO2-driven Cold-Water Geyser in Chimayo, New Mexico

    Authors: B. Yuan, Y. J. Tan, M. K. Mudunuru, O. E. Marcillo, A. A. Delorey, P. M. Roberts, J. D. Webster, C. N. L. Gammans, S. Karra, G. D. Guthrie, P. A. Johnson

    Abstract: We present an approach based on machine learning (ML) to distinguish eruption and precursory signals of Chimayó geyser (New Mexico, USA) under noisy environments. This geyser can be considered as a natural analog of $\mathrm{CO}_2$ intrusion into shallow water aquifers. By studying this geyser, we can understand upwelling of $\mathrm{CO}_2$-rich fluids from depth, which has relevance to leak monit… ▽ More

    Submitted 1 October, 2018; originally announced October 2018.

    Comments: 16 pages,7 figures