Skip to main content

Showing 1–17 of 17 results for author: Arasteh, S T

  1. arXiv:2404.08064  [pdf

    eess.AS cs.AI cs.CR cs.LG

    The Impact of Speech Anonymization on Pathology and Its Limits

    Authors: Soroosh Tayebi Arasteh, Tomas Arias-Vergara, Paula Andrea Perez-Toro, Tobias Weise, Kai Packhaeuser, Maria Schuster, Elmar Noeth, Andreas Maier, Seung Hee Yang

    Abstract: Integration of speech into healthcare has intensified privacy concerns due to its potential as a non-invasive biomarker containing individual biometric information. In response, speaker anonymization aims to conceal personally identifiable information while retaining crucial linguistic content. However, the application of anonymization techniques to pathological speech, a critical area where priva… ▽ More

    Submitted 22 June, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  2. arXiv:2310.00757  [pdf

    cs.CV cs.AI cs.LG eess.IV

    Mind the Gap: Federated Learning Broadens Domain Generalization in Diagnostic AI Models

    Authors: Soroosh Tayebi Arasteh, Christiane Kuhl, Marwin-Jonathan Saehn, Peter Isfort, Daniel Truhn, Sven Nebelung

    Abstract: Developing robust artificial intelligence (AI) models that generalize well to unseen datasets is challenging and usually requires large and variable datasets, preferably from multiple institutions. In federated learning (FL), a model is trained collaboratively at numerous sites that hold local datasets without exchanging them. So far, the impact of training strategy, i.e., local versus collaborati… ▽ More

    Submitted 19 December, 2023; v1 submitted 1 October, 2023; originally announced October 2023.

    Comments: Published in Nature Scientific Reports

    Journal ref: Sci Rep 13, 22576 (2023)

  3. arXiv:2308.14120  [pdf

    cs.LG cs.AI cs.CL

    Large Language Models Streamline Automated Machine Learning for Clinical Studies

    Authors: Soroosh Tayebi Arasteh, Tianyu Han, Mahshad Lotfinia, Christiane Kuhl, Jakob Nikolas Kather, Daniel Truhn, Sven Nebelung

    Abstract: A knowledge gap persists between machine learning (ML) developers (e.g., data scientists) and practitioners (e.g., clinicians), hampering the full utilization of ML for clinical data analysis. We investigated the potential of the ChatGPT Advanced Data Analysis (ADA), an extension of GPT-4, to bridge this gap and perform ML analyses efficiently. Real-world clinical datasets and study details from l… ▽ More

    Submitted 21 February, 2024; v1 submitted 27 August, 2023; originally announced August 2023.

    Comments: Published in Nature Communications

    Journal ref: Nat Commun 15, 1603 (2024)

  4. arXiv:2308.07688  [pdf

    eess.IV cs.CV cs.LG

    Enhancing Network Initialization for Medical AI Models Using Large-Scale, Unlabeled Natural Images

    Authors: Soroosh Tayebi Arasteh, Leo Misera, Jakob Nikolas Kather, Daniel Truhn, Sven Nebelung

    Abstract: Pre-training datasets, like ImageNet, have become the gold standard in medical image analysis. However, the emergence of self-supervised learning (SSL), which leverages unlabeled data to learn robust features, presents an opportunity to bypass the intensive labeling process. In this study, we explored if SSL for pre-training on non-medical images can be applied to chest radiographs and how it comp… ▽ More

    Submitted 8 February, 2024; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: Published in European Radiology Experimental

    Journal ref: Eur Radiol Exp 8, 10 (2024)

  5. arXiv:2306.06503  [pdf

    cs.LG cs.AI cs.CR eess.IV

    Preserving privacy in domain transfer of medical AI models comes at no performance costs: The integral role of differential privacy

    Authors: Soroosh Tayebi Arasteh, Mahshad Lotfinia, Teresa Nolte, Marwin Saehn, Peter Isfort, Christiane Kuhl, Sven Nebelung, Georgios Kaissis, Daniel Truhn

    Abstract: Developing robust and effective artificial intelligence (AI) models in medicine requires access to large amounts of patient data. The use of AI models solely trained on large multi-institutional datasets can help with this, yet the imperative to ensure data privacy remains, particularly as membership inference risks breaching patient confidentiality. As a proposed remedy, we advocate for the integ… ▽ More

    Submitted 7 December, 2023; v1 submitted 10 June, 2023; originally announced June 2023.

    Comments: Published in Radiology: Artificial Intelligence. RSNA

    Journal ref: Radiology: Artificial Intelligence, 2024, 6(1), e230212

  6. Federated learning for secure development of AI models for Parkinson's disease detection using speech from different languages

    Authors: Soroosh Tayebi Arasteh, Cristian David Rios-Urrego, Elmar Noeth, Andreas Maier, Seung Hee Yang, Jan Rusz, Juan Rafael Orozco-Arroyave

    Abstract: Parkinson's disease (PD) is a neurological disorder impacting a person's speech. Among automatic PD assessment methods, deep learning models have gained particular interest. Recently, the community has explored cross-pathology and cross-language models which can improve diagnostic accuracy even further. However, strict patient data privacy regulations largely prevent institutions from sharing pati… ▽ More

    Submitted 21 August, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: INTERSPEECH 2023, pp. 5003--5007, Dublin, Ireland

    Journal ref: INTERSPEECH 2023

  7. Fibroglandular Tissue Segmentation in Breast MRI using Vision Transformers -- A multi-institutional evaluation

    Authors: Gustav Müller-Franzes, Fritz Müller-Franzes, Luisa Huck, Vanessa Raaff, Eva Kemmer, Firas Khader, Soroosh Tayebi Arasteh, Teresa Nolte, Jakob Nikolas Kather, Sven Nebelung, Christiane Kuhl, Daniel Truhn

    Abstract: Accurate and automatic segmentation of fibroglandular tissue in breast MRI screening is essential for the quantification of breast density and background parenchymal enhancement. In this retrospective study, we developed and evaluated a transformer-based neural network for breast segmentation (TraBS) in multi-institutional MRI data, and compared its performance to the well established convolutiona… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Journal ref: Sci Rep 13, 14207 (2023)

  8. arXiv:2302.01622  [pdf, other

    eess.IV cs.AI cs.CR cs.CV cs.LG

    Private, fair and accurate: Training large-scale, privacy-preserving AI models in medical imaging

    Authors: Soroosh Tayebi Arasteh, Alexander Ziller, Christiane Kuhl, Marcus Makowski, Sven Nebelung, Rickmer Braren, Daniel Rueckert, Daniel Truhn, Georgios Kaissis

    Abstract: Artificial intelligence (AI) models are increasingly used in the medical domain. However, as medical data is highly sensitive, special precautions to ensure its protection are required. The gold standard for privacy preservation is the introduction of differential privacy (DP) to model training. Prior work indicates that DP has negative implications on model accuracy and fairness, which are unacce… ▽ More

    Submitted 16 March, 2024; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: Published in Communications Medicine. Nature Portfolio

    Journal ref: Commun Med 4(1), 46 (2024)

  9. arXiv:2212.09162  [pdf

    cs.LG cs.AI

    Medical Diagnosis with Large Scale Multimodal Transformers: Leveraging Diverse Data for More Accurate Diagnosis

    Authors: Firas Khader, Gustav Mueller-Franzes, Tianci Wang, Tianyu Han, Soroosh Tayebi Arasteh, Christoph Haarburger, Johannes Stegmaier, Keno Bressem, Christiane Kuhl, Sven Nebelung, Jakob Nikolas Kather, Daniel Truhn

    Abstract: Multimodal deep learning has been used to predict clinical endpoints and diagnoses from clinical routine data. However, these models suffer from scaling issues: they have to learn pairwise interactions between each piece of information in each data type, thereby escalating model complexity beyond manageable scales. This has so far precluded a widespread use of multimodal deep learning. Here, we pr… ▽ More

    Submitted 20 December, 2022; v1 submitted 18 December, 2022; originally announced December 2022.

  10. Diffusion Probabilistic Models beat GANs on Medical Images

    Authors: Gustav Müller-Franzes, Jan Moritz Niehues, Firas Khader, Soroosh Tayebi Arasteh, Christoph Haarburger, Christiane Kuhl, Tianci Wang, Tianyu Han, Sven Nebelung, Jakob Nikolas Kather, Daniel Truhn

    Abstract: The success of Deep Learning applications critically depends on the quality and scale of the underlying training data. Generative adversarial networks (GANs) can generate arbitrary large datasets, but diversity and fidelity are limited, which has recently been addressed by denoising diffusion probabilistic models (DDPMs) whose superiority has been demonstrated on natural images. In this study, we… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Journal ref: Sci Rep 13, 12098 (2023)

  11. arXiv:2211.13606  [pdf

    cs.LG cs.AI eess.IV

    Collaborative Training of Medical Artificial Intelligence Models with non-uniform Labels

    Authors: Soroosh Tayebi Arasteh, Peter Isfort, Marwin Saehn, Gustav Mueller-Franzes, Firas Khader, Jakob Nikolas Kather, Christiane Kuhl, Sven Nebelung, Daniel Truhn

    Abstract: Due to the rapid advancements in recent years, medical image analysis is largely dominated by deep learning (DL). However, building powerful and robust DL models requires training with large multi-party datasets. While multiple stakeholders have provided publicly available datasets, the ways in which these data are labeled vary widely. For Instance, an institution might provide a dataset of chest… ▽ More

    Submitted 13 April, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: Published in Nature Scientific Reports

    Journal ref: Sci Rep 13, 6046 (2023)

  12. arXiv:2211.03364  [pdf, other

    eess.IV cs.CV cs.LG

    Medical Diffusion: Denoising Diffusion Probabilistic Models for 3D Medical Image Generation

    Authors: Firas Khader, Gustav Mueller-Franzes, Soroosh Tayebi Arasteh, Tianyu Han, Christoph Haarburger, Maximilian Schulze-Hagen, Philipp Schad, Sandy Engelhardt, Bettina Baessler, Sebastian Foersch, Johannes Stegmaier, Christiane Kuhl, Sven Nebelung, Jakob Nikolas Kather, Daniel Truhn

    Abstract: Recent advances in computer vision have shown promising results in image generation. Diffusion probabilistic models in particular have generated realistic images from textual input, as demonstrated by DALL-E 2, Imagen and Stable Diffusion. However, their use in medicine, where image data typically comprises three-dimensional volumes, has not been systematically evaluated. Synthetic images may play… ▽ More

    Submitted 3 January, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

  13. arXiv:2204.06450  [pdf, other

    cs.SD cs.LG eess.AS

    The effect of speech pathology on automatic speaker verification -- a large-scale study

    Authors: Soroosh Tayebi Arasteh, Tobias Weise, Maria Schuster, Elmar Noeth, Andreas Maier, Seung Hee Yang

    Abstract: Navigating the challenges of data-driven speech processing, one of the primary hurdles is accessing reliable pathological speech data. While public datasets appear to offer solutions, they come with inherent risks of potential unintended exposure of patient health information via re-identification attacks. Using a comprehensive real-world pathological speech corpus, with over n=3,800 test subjects… ▽ More

    Submitted 22 November, 2023; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: Published in Scientific Reports

    Journal ref: Sci Rep 13, 20476 (2023)

  14. How Will Your Tweet Be Received? Predicting the Sentiment Polarity of Tweet Replies

    Authors: Soroosh Tayebi Arasteh, Mehrpad Monajem, Vincent Christlein, Philipp Heinrich, Anguelos Nicolaou, Hamidreza Naderi Boldaji, Mahshad Lotfinia, Stefan Evert

    Abstract: Twitter sentiment analysis, which often focuses on predicting the polarity of tweets, has attracted increasing attention over the last years, in particular with the rise of deep learning (DL). In this paper, we propose a new task: predicting the predominant sentiment among (first-order) replies to a given tweet. Therefore, we created RETWEET, a large dataset of tweets and replies manually annotate… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: Published in 2021 IEEE 15th International Conference on Semantic Computing (ICSC)

    Journal ref: 2021 IEEE 15th International Conference on Semantic Computing (ICSC), Laguna Hills, CA, USA, 2021, pp. 356-359

  15. arXiv:2102.02470  [pdf, other

    cs.LG cond-mat.mtrl-sci

    Machine Learning-Based Generalized Model for Finite Element Analysis of Roll Deflection During the Austenitic Stainless Steel 316L Strip Rolling

    Authors: Mahshad Lotfinia, Soroosh Tayebi Arasteh

    Abstract: During the strip rolling process, a considerable amount of the forces of the material pressure cause elastic deformation on the work-roll, i.e., the deflection process. The uncontrollable amount of the work-roll deflection leads to the high deviations in the permissible thickness of the plate along its width. In the context of the Austenitic Stainless Steels (ASS), due to the instability of the Au… ▽ More

    Submitted 24 April, 2022; v1 submitted 4 February, 2021; originally announced February 2021.

    Comments: 11 pages, 8 figures

  16. arXiv:2011.08232  [pdf, other

    cs.GR cs.CG math.AG

    Conversion Between Cubic Bezier Curves and Catmull-Rom Splines

    Authors: Soroosh Tayebi Arasteh, Adam Kalisz

    Abstract: Splines are one of the main methods of mathematically representing complicated shapes, which have become the primary technique in the fields of Computer Graphics (CG) and Computer-Aided Geometric Design (CAGD) for modeling complex surfaces. Among all, Bézier and Catmull-Rom splines are the most common in the sub-fields of engineering. In this paper, we focus on conversion between cubic Bézier and… ▽ More

    Submitted 31 July, 2021; v1 submitted 16 November, 2020; originally announced November 2020.

    Comments: Published in SN Computer Science

    Journal ref: SN COMPUT. SCI. 2, 398 (2021)

  17. arXiv:2011.04896  [pdf, ps, other

    eess.AS cs.AI cs.CL cs.LG

    An Empirical Study on Text-Independent Speaker Verification based on the GE2E Method

    Authors: Soroosh Tayebi Arasteh

    Abstract: While many researchers in the speaker recognition area have started to replace the former classical state-of-the-art methods with deep learning techniques, some of the traditional i-vector-based methods are still state-of-the-art in the context of text-independent speaker verification. Google's Generalized End-to-End Loss for Speaker Verification (GE2E), a deep learning-based technique using long… ▽ More

    Submitted 27 February, 2022; v1 submitted 10 November, 2020; originally announced November 2020.

    Comments: 6 pages, 7 tables, 2 figures, 4 algorithms. An empirical study on the paper arXiv:1710.10467 by Wan et al. (2017)