Skip to main content

Showing 1–7 of 7 results for author: Shiraishi, T

  1. arXiv:2406.18902  [pdf, other

    stat.ML cs.LG

    Statistical Test for Data Analysis Pipeline by Selective Inference

    Authors: Tomohiro Shiraishi, Tatsuya Matsukawa, Shuichi Nishino, Ichiro Takeuchi

    Abstract: A data analysis pipeline is a structured sequence of processing steps that transforms raw data into meaningful insights by effectively integrating various analysis algorithms. In this paper, we propose a novel statistical test designed to assess the statistical significance of data analysis pipelines. Our approach allows for the systematic development of valid statistical tests applicable to any d… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2402.11789  [pdf, other

    stat.ML cs.CV cs.LG

    Statistical Test for Generated Hypotheses by Diffusion Models

    Authors: Teruyuki Katsuoka, Tomohiro Shiraishi, Daiki Miwa, Vo Nguyen Le Duy, Ichiro Takeuchi

    Abstract: The enhanced performance of AI has accelerated its integration into scientific research. In particular, the use of generative AI to create scientific hypotheses is promising and is increasingly being applied across various fields. However, when employing AI-generated hypotheses for critical decisions, such as medical diagnoses, verifying their reliability is crucial. In this study, we consider a m… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: 32pages, 6figures

  3. arXiv:2402.03724  [pdf, other

    stat.ML cs.LG

    Statistical Test for Anomaly Detections by Variational Auto-Encoders

    Authors: Daiki Miwa, Tomohiro Shiraishi, Vo Nguyen Le Duy, Teruyuki Katsuoka, Ichiro Takeuchi

    Abstract: In this study, we consider the reliability assessment of anomaly detection (AD) using Variational Autoencoder (VAE). Over the last decade, VAE-based AD has been actively studied in various perspective, from method development to applied research. However, when the results of ADs are used in high-stakes decision-making, such as in medical diagnosis, it is necessary to ensure the reliability of the… ▽ More

    Submitted 2 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  4. arXiv:2401.08169  [pdf, other

    stat.ML cs.LG

    Statistical Test for Attention Map in Vision Transformer

    Authors: Tomohiro Shiraishi, Daiki Miwa, Teruyuki Katsuoka, Vo Nguyen Le Duy, Kouichi Taji, Ichiro Takeuchi

    Abstract: The Vision Transformer (ViT) demonstrates exceptional performance in various computer vision tasks. Attention is crucial for ViT to capture complex wide-ranging relationships among image patches, allowing the model to weigh the importance of image patches and aiding our understanding of the decision-making process. However, when utilizing the attention of ViT as evidence in high-stakes decision-ma… ▽ More

    Submitted 19 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: 42pages, 17figures

  5. arXiv:2311.14964  [pdf, other

    stat.ML cs.LG

    Selective Inference for Changepoint detection by Recurrent Neural Network

    Authors: Tomohiro Shiraishi, Daiki Miwa, Vo Nguyen Le Duy, Ichiro Takeuchi

    Abstract: In this study, we investigate the quantification of the statistical reliability of detected change points (CPs) in time series using a Recurrent Neural Network (RNN). Thanks to its flexibility, RNN holds the potential to effectively identify CPs in time series characterized by complex dynamics. However, there is an increased risk of erroneously detecting random noise fluctuations as CPs. The prima… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: 41pages, 16figures

  6. arXiv:2307.11351  [pdf, other

    stat.ML cs.LG

    Bounded P-values in Parametric Programming-based Selective Inference

    Authors: Tomohiro Shiraishi, Daiki Miwa, Vo Nguyen Le Duy, Ichiro Takeuchi

    Abstract: Selective inference (SI) has been actively studied as a promising framework for statistical hypothesis testing for data-driven hypotheses. The basic idea of SI is to make inferences conditional on an event that a hypothesis is selected. In order to perform SI, this event must be characterized in a traceable form. When selection event is too difficult to characterize, additional conditions are intr… ▽ More

    Submitted 28 December, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: 48pages, 14figures

  7. arXiv:1902.09722  [pdf, other

    cs.LG stat.ML

    Topological Bayesian Optimization with Persistence Diagrams

    Authors: Tatsuya Shiraishi, Tam Le, Hisashi Kashima, Makoto Yamada

    Abstract: Finding an optimal parameter of a black-box function is important for searching stable material structures and finding optimal neural network structures, and Bayesian optimization algorithms are widely used for the purpose. However, most of existing Bayesian optimization algorithms can only handle vector data and cannot handle complex structured data. In this paper, we propose the topological Baye… ▽ More

    Submitted 25 February, 2019; originally announced February 2019.