Skip to main content

Showing 1–5 of 5 results for author: Fürst, A

  1. arXiv:2402.12365  [pdf, other

    cs.LG cs.AI physics.flu-dyn

    Universal Physics Transformers: A Framework For Efficiently Scaling Neural Operators

    Authors: Benedikt Alkin, Andreas Fürst, Simon Schmid, Lukas Gruber, Markus Holzleitner, Johannes Brandstetter

    Abstract: Neural operators, serving as physics surrogate models, have recently gained increased interest. With ever increasing problem complexity, the natural question arises: what is an efficient way to scale neural operators to larger and more complex simulations - most importantly by taking into account different types of simulation datasets. This is of special interest since, akin to their numerical cou… ▽ More

    Submitted 30 April, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  2. arXiv:2304.10520  [pdf, other

    cs.CV cs.AI cs.LG

    Contrastive Tuning: A Little Help to Make Masked Autoencoders Forget

    Authors: Johannes Lehner, Benedikt Alkin, Andreas Fürst, Elisabeth Rumetshofer, Lukas Miklautz, Sepp Hochreiter

    Abstract: Masked Image Modeling (MIM) methods, like Masked Autoencoders (MAE), efficiently learn a rich representation of the input. However, for adapting to downstream tasks, they require a sufficient amount of labeled data since their rich features code not only objects but also less relevant image background. In contrast, Instance Discrimination (ID) methods focus on objects. In this work, we study how t… ▽ More

    Submitted 14 September, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

  3. arXiv:2110.11316  [pdf, other

    cs.LG cs.CV

    CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP

    Authors: Andreas Fürst, Elisabeth Rumetshofer, Johannes Lehner, Viet Tran, Fei Tang, Hubert Ramsauer, David Kreil, Michael Kopp, Günter Klambauer, Angela Bitto-Nemling, Sepp Hochreiter

    Abstract: CLIP yielded impressive results on zero-shot transfer learning tasks and is considered as a foundation model like BERT or GPT3. CLIP vision models that have a rich representation are pre-trained using the InfoNCE objective and natural language supervision before they are fine-tuned on particular tasks. Though CLIP excels at zero-shot transfer learning, it suffers from an explaining away problem, t… ▽ More

    Submitted 7 November, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: Published at NeurIPS 2022; Blog: https://ml-jku.github.io/cloob; GitHub: https://github.com/ml-jku/cloob

  4. Cloud-scale VM Deflation for Running Interactive Applications On Transient Servers

    Authors: Alexander Fuerst, Ahmed Ali-Eldin, Prashant Shenoy, Prateek Sharma

    Abstract: Transient computing has become popular in public cloud environments for running delay-insensitive batch and data processing applications at low cost. Since transient cloud servers can be revoked at any time by the cloud provider, they are considered unsuitable for running interactive application such as web services. In this paper, we present VM deflation as an alternative mechanism to server pree… ▽ More

    Submitted 31 May, 2020; originally announced June 2020.

    Comments: To appear at ACM HPDC 2020

  5. arXiv:1210.7283  [pdf, other

    cs.SE

    Abstract Data Types in Event-B - An Application of Generic Instantiation

    Authors: David Basin, Andreas Fürst, Thai Son Hoang, Kunihiko Miyazaki, Naoto Sato

    Abstract: Integrating formal methods into industrial practice is a challenging task. Often, different kinds of expertise are required within the same development. On the one hand, there are domain engineers who have specific knowledge of the system under development. On the other hand, there are formal methods experts who have experience in rigorously specifying and reasoning about formal systems. Coordinat… ▽ More

    Submitted 26 October, 2012; originally announced October 2012.

    Comments: In Proceedings of DS-Event-B 2012: Workshop on the experience of and advances in developing dependable systems in Event-B, in conjunction with ICFEM 2012 - Kyoto, Japan, November 13, 2012