Skip to main content

Showing 1–50 of 71 results for author: Fischer, T

  1. arXiv:2407.10791  [pdf, other

    cs.HC

    Interactive Public Transport Infrastructure Analysis through Mobility Profiles: Making the Mobility Transition Transparent

    Authors: Yannick Metz, Dennis Ackermann, Daniel A. Keim, Maximilian T. Fischer

    Abstract: Efficient public transport systems are crucial for sustainable urban development as cities face increasing mobility demands. Yet, many public transport networks struggle to meet diverse user needs due to historical development, urban constraints, and financial limitations. Traditionally, planning of transport network structure is often based on limited surveys, expert opinions, or partial usage st… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 9 pages, 8 figures

    ACM Class: H.5.2

  2. arXiv:2407.10652  [pdf, other

    cs.LG cs.DL cs.HC

    Cutting Through the Clutter: The Potential of LLMs for Efficient Filtration in Systematic Literature Reviews

    Authors: Lucas Joos, Daniel A. Keim, Maximilian T. Fischer

    Abstract: In academic research, systematic literature reviews are foundational and highly relevant, yet tedious to create due to the high volume of publications and labor-intensive processes involved. Systematic selection of relevant papers through conventional means like keyword-based filtering techniques can sometimes be inadequate, plagued by semantic ambiguities and inconsistent terminology, which can l… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 5 pages, 5 figures, 1 table

    ACM Class: H.5.2

  3. arXiv:2407.09271  [pdf, other

    cs.CV cs.LG

    iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning

    Authors: Tom Fischer, Yaoyao Liu, Artur Jesslen, Noor Ahmed, Prakhar Kaushik, Angtian Wang, Alan Yuille, Adam Kortylewski, Eddy Ilg

    Abstract: Different from human nature, it is still common practice today for vision tasks to train deep learning models only initially and on fixed datasets. A variety of approaches have recently addressed handling continual data streams. However, extending these methods to manage out-of-distribution (OOD) scenarios has not effectively been investigated. On the other hand, it has recently been shown that no… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  4. arXiv:2407.05427  [pdf, other

    cs.HC cs.IR

    MelodyVis: Visual Analytics for Melodic Patterns in Sheet Music

    Authors: Matthias Miller, Daniel Fürst, Maximilian T. Fischer, Hanna Hauptmann, Daniel Keim, Mennatallah El-Assady

    Abstract: Manual melody detection is a tedious task requiring high expertise level, while automatic detection is often not expressive or powerful enough. Thus, we present MelodyVis, a visual application designed in collaboration with musicology experts to explore melodic patterns in digital sheet music. MelodyVis features five connected views, including a Melody Operator Graph and a Voicing Timeline. The sy… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 9+2 pages, 9 figures, preprint, originally submitted to IEEE VIS 23, revision

    ACM Class: I.5.4; H.3.3; J.5.7

  5. arXiv:2406.19543  [pdf, other

    cs.CL cs.SI

    Demarked: A Strategy for Enhanced Abusive Speech Moderation through Counterspeech, Detoxification, and Message Management

    Authors: Seid Muhie Yimam, Daryna Dementieva, Tim Fischer, Daniil Moskovskiy, Naquee Rizwan, Punyajoy Saha, Sarthak Roy, Martin Semmann, Alexander Panchenko, Chris Biemann, Animesh Mukherjee

    Abstract: Despite regulations imposed by nations and social media platforms, such as recent EU regulations targeting digital violence, abusive content persists as a significant challenge. Existing approaches primarily rely on binary solutions, such as outright blocking or banning, yet fail to address the complex nature of abusive speech. In this work, we propose a more comprehensive approach called Demarcat… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  6. arXiv:2406.15068  [pdf, other

    cs.AR

    Occamy: A 432-Core 28.1 DP-GFLOP/s/W 83% FPU Utilization Dual-Chiplet, Dual-HBM2E RISC-V-based Accelerator for Stencil and Sparse Linear Algebra Computations with 8-to-64-bit Floating-Point Support in 12nm FinFET

    Authors: Gianna Paulin, Paul Scheffler, Thomas Benz, Matheus Cavalcante, Tim Fischer, Manuel Eggimann, Yichao Zhang, Nils Wistoff, Luca Bertaccini, Luca Colagrande, Gianmarco Ottavi, Frank K. Gürkaynak, Davide Rossi, Luca Benini

    Abstract: We present Occamy, a 432-core RISC-V dual-chiplet 2.5D system for efficient sparse linear algebra and stencil computations on FP64 and narrow (32-, 16-, 8-bit) SIMD FP data. Occamy features 48 clusters of RISC-V cores with custom extensions, two 64-bit host cores, and a latency-tolerant multi-chiplet interconnect and memory system with 32 GiB of HBM2E. It achieves leading-edge utilization on stenc… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 2 pages, 7 figures. Accepted at the 2024 IEEE Symposium on VLSI Technology & Circuits

  7. arXiv:2406.03175  [pdf, other

    cs.CV

    Dynamic 3D Gaussian Fields for Urban Areas

    Authors: Tobias Fischer, Jonas Kulhanek, Samuel Rota Bulò, Lorenzo Porzi, Marc Pollefeys, Peter Kontschieder

    Abstract: We present an efficient neural 3D scene representation for novel-view synthesis (NVS) in large-scale, dynamic urban areas. Existing works are not well suited for applications like mixed-reality or closed-loop simulation due to their limited visual quality and non-interactive rendering speeds. Recently, rasterization-based approaches have achieved high-quality NVS at impressive speeds. However, the… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Project page is available at https://tobiasfshr.github.io/pub/4dgf/

  8. arXiv:2405.19284  [pdf, other

    cs.DC cs.AI cs.AR

    Optimizing Foundation Model Inference on a Many-tiny-core Open-source RISC-V Platform

    Authors: Viviane Potocnik, Luca Colagrande, Tim Fischer, Luca Bertaccini, Daniele Jahier Pagliari, Alessio Burrello, Luca Benini

    Abstract: Transformer-based foundation models have become crucial for various domains, most notably natural language processing (NLP) or computer vision (CV). These models are predominantly deployed on high-performance GPUs or hardwired accelerators with highly customized, proprietary instruction sets. Until now, limited attention has been given to RISC-V-based general-purpose platforms. In our work, we pre… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 14 pages, 10 figures, 4 tables, IEEE Transactions on Circuits and Systems for Artificial Intelligence

    ACM Class: C.4; C.3; I.2

  9. arXiv:2405.14599  [pdf, other

    cs.CV cs.LG

    Neuroexplicit Diffusion Models for Inpainting of Optical Flow Fields

    Authors: Tom Fischer, Pascal Peter, Joachim Weickert, Eddy Ilg

    Abstract: Deep learning has revolutionized the field of computer vision by introducing large scale neural networks with millions of parameters. Training these networks requires massive datasets and leads to intransparent models that can fail to generalize. At the other extreme, models designed from partial differential equations (PDEs) embed specialized domain knowledge into mathematical equations and usual… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  10. arXiv:2404.16044  [pdf, other

    cs.HC cs.GR

    Toward the Categorical Data Map

    Authors: Frederik L. Dennig, Lucas Joos, Patrick Paetzold, Daniela Blumberg, Oliver Deussen, Daniel A. Keim, Maximilian T. Fischer

    Abstract: Categorical data does not have an intrinsic definition of distance or order, and therefore, established visualization techniques for categorical data only allow for a set-based or frequency-based analysis, e.g., through Euler diagrams or Parallel Sets, and do not support a similarity-based analysis. We present a novel dimensionality reduction-based visualization for categorical data, which is base… ▽ More

    Submitted 14 May, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: 12 pages, 10 figures, LaTeX; formatting; corrected typo

  11. arXiv:2404.09406  [pdf, other

    cs.CV cs.HC cs.LG cs.RO

    Human-in-the-Loop Segmentation of Multi-species Coral Imagery

    Authors: Scarlett Raine, Ross Marchant, Brano Kusy, Frederic Maire, Niko Suenderhauf, Tobias Fischer

    Abstract: Broad-scale marine surveys performed by underwater vehicles significantly increase the availability of coral reef imagery, however it is costly and time-consuming for domain experts to label images. Point label propagation is an approach used to leverage existing image data labeled with sparse point labels. The resulting augmented ground truth generated is then used to train a semantic segmentatio… ▽ More

    Submitted 16 April, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: Accepted at the CVPR2024 3rd Workshop on Learning with Limited Labelled Data for Image and Video Understanding (L3D-IVU), 10 pages, 6 figures, an additional 4 pages of supplementary material

  12. arXiv:2404.03658  [pdf, other

    cs.CV

    Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning

    Authors: Rui Li, Tobias Fischer, Mattia Segu, Marc Pollefeys, Luc Van Gool, Federico Tombari

    Abstract: Recovering the 3D scene geometry from a single view is a fundamental yet ill-posed problem in computer vision. While classical depth estimation methods infer only a 2.5D scene representation limited to the image plane, recent approaches based on radiance fields reconstruct a full 3D representation. However, these methods still struggle with occluded regions since inferring geometry without visual… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: CVPR 2024. Project page: https://ruili3.github.io/kyn

  13. arXiv:2404.03073  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Mai Ho'omāuna i ka 'Ai: Language Models Improve Automatic Speech Recognition in Hawaiian

    Authors: Kaavya Chaparala, Guido Zarrella, Bruce Torres Fischer, Larry Kimura, Oiwi Parker Jones

    Abstract: In this paper we address the challenge of improving Automatic Speech Recognition (ASR) for a low-resource language, Hawaiian, by incorporating large amounts of independent text data into an ASR foundation model, Whisper. To do this, we train an external language model (LM) on ~1.5M words of Hawaiian text. We then use the LM to rescore Whisper and compute word error rates (WERs) on a manually curat… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  14. arXiv:2404.00168  [pdf, other

    cs.CV

    Multi-Level Neural Scene Graphs for Dynamic Urban Environments

    Authors: Tobias Fischer, Lorenzo Porzi, Samuel Rota Bulò, Marc Pollefeys, Peter Kontschieder

    Abstract: We estimate the radiance field of large-scale dynamic areas from multiple vehicle captures under varying environmental conditions. Previous works in this domain are either restricted to static environments, do not scale to more than a single short video, or struggle to separately represent dynamic object instances. To this end, we present a novel, decomposable radiance field approach for dynamic u… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

    Comments: CVPR 2024. Project page is available at https://tobiasfshr.github.io/pub/ml-nsg/

  15. arXiv:2403.16425  [pdf, other

    cs.RO cs.CV

    Enhancing Visual Place Recognition via Fast and Slow Adaptive Biasing in Event Cameras

    Authors: Gokul B. Nair, Michael Milford, Tobias Fischer

    Abstract: Event cameras are increasingly popular in robotics due to their beneficial features, such as low latency, energy efficiency, and high dynamic range. Nevertheless, their downstream task performance is greatly influenced by the optimization of bias parameters. These parameters, for instance, regulate the necessary change in light intensity to trigger an event, which in turn depends on factors such a… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 8 pages, 9 figures, paper under review

  16. arXiv:2403.15313  [pdf, other

    cs.CV cs.AI

    CR3DT: Camera-RADAR Fusion for 3D Detection and Tracking

    Authors: Nicolas Baumann, Michael Baumgartner, Edoardo Ghignone, Jonas Kühne, Tobias Fischer, Yung-Hsu Yang, Marc Pollefeys, Michele Magno

    Abstract: Accurate detection and tracking of surrounding objects is essential to enable self-driving vehicles. While Light Detection and Ranging (LiDAR) sensors have set the benchmark for high performance, the appeal of camera-only solutions lies in their cost-effectiveness. Notably, despite the prevalent use of Radio Detection and Ranging (RADAR) sensors in automotive systems, their potential in 3D detecti… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  17. arXiv:2401.01955  [pdf, other

    cs.HC cs.MM

    MULTI-CASE: A Transformer-based Ethics-aware Multimodal Investigative Intelligence Framework

    Authors: Maximilian T. Fischer, Yannick Metz, Lucas Joos, Matthias Miller, Daniel A. Keim

    Abstract: AI-driven models are increasingly deployed in operational analytics solutions, for instance, in investigative journalism or the intelligence community. Current approaches face two primary challenges: ethical and privacy concerns, as well as difficulties in efficiently combining heterogeneous data sources for multimodal analytics. To tackle the challenge of multimodal analytics, we present MULTI-CA… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: 6 pages, 3 figures, 1 table

  18. arXiv:2311.14276  [pdf, other

    cs.RO cs.CV

    Racing With ROS 2 A Navigation System for an Autonomous Formula Student Race Car

    Authors: Alastair Bradford, Grant van Breda, Tobias Fischer

    Abstract: The advent of autonomous vehicle technologies has significantly impacted various sectors, including motorsport, where Formula Student and Formula: Society of Automotive Engineers introduced autonomous racing classes. These offer new challenges to aspiring engineers, including the team at QUT Motorsport, but also raise the entry barrier due to the complexity of high-speed navigation and control. Th… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: 10 pages, 6 figures

    Journal ref: Australasian Conference on Robotics and Automation (ACRA 2023)

  19. arXiv:2311.13186  [pdf, other

    cs.CV cs.RO

    Applications of Spiking Neural Networks in Visual Place Recognition

    Authors: Somayeh Hussaini, Michael Milford, Tobias Fischer

    Abstract: In robotics, Spiking Neural Networks (SNNs) are increasingly recognized for their largely-unrealized potential energy efficiency and low latency particularly when implemented on neuromorphic hardware. Our paper highlights three advancements for SNNs in Visual Place Recognition (VPR). First, we propose Modular SNNs, where each SNN represents a set of non-overlapping geographically distinct places,… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: 17 pages, 8 figures, under review

  20. arXiv:2311.02872  [pdf, other

    cs.CV

    FocusTune: Tuning Visual Localization through Focus-Guided Sampling

    Authors: Son Tung Nguyen, Alejandro Fontan, Michael Milford, Tobias Fischer

    Abstract: We propose FocusTune, a focus-guided sampling technique to improve the performance of visual localization algorithms. FocusTune directs a scene coordinate regression model towards regions critical for 3D point triangulation by exploiting key geometric constraints. Specifically, rather than uniformly sampling points across the image for training the scene coordinate regression model, we instead re-… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

  21. arXiv:2309.15405  [pdf

    cs.RO eess.SY

    Teach and Repeat Navigation: A Robust Control Approach

    Authors: Payam Nourizadeh, Michael Milford, Tobias Fischer

    Abstract: Robot navigation requires an autonomy pipeline that is robust to environmental changes and effective in varying conditions. Teach and Repeat (T&R) navigation has shown high performance in autonomous repeated tasks under challenging circumstances, but research within T&R has predominantly focused on motion planning as opposed to motion control. In this paper, we propose a novel T&R system based on… ▽ More

    Submitted 29 May, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE International Conference on Robotics and Automation 2024 (ICRA2024)

  22. arXiv:2309.10225  [pdf, other

    cs.RO

    VPRTempo: A Fast Temporally Encoded Spiking Neural Network for Visual Place Recognition

    Authors: Adam D. Hines, Peter G. Stratton, Michael Milford, Tobias Fischer

    Abstract: Spiking Neural Networks (SNNs) are at the forefront of neuromorphic computing thanks to their potential energy-efficiency, low latencies, and capacity for continual learning. While these capabilities are well suited for robotics tasks, SNNs have seen limited adaptation in this field thus far. This work introduces a SNN for Visual Place Recognition (VPR) that is both trainable within minutes and qu… ▽ More

    Submitted 29 February, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 8 pages, 3 figures, accepted to the IEEE International Conference on Robotics and Automation (ICRA) 2024

  23. arXiv:2308.14713  [pdf, other

    cs.CV

    R3D3: Dense 3D Reconstruction of Dynamic Scenes from Multiple Cameras

    Authors: Aron Schmied, Tobias Fischer, Martin Danelljan, Marc Pollefeys, Fisher Yu

    Abstract: Dense 3D reconstruction and ego-motion estimation are key challenges in autonomous driving and robotics. Compared to the complex, multi-modal systems deployed today, multi-camera systems provide a simpler, low-cost alternative. However, camera-based 3D reconstruction of complex dynamic scenes has proven extremely difficult, as existing solutions often produce incomplete or incoherent results. We p… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023. Project page is available at https://www.vis.xyz/pub/r3d3/

  24. arXiv:2308.00257  [pdf, other

    cs.RO

    Trajectory Tracking via Multiscale Continuous Attractor Networks

    Authors: Therese Joseph, Tobias Fischer, Michael Milford

    Abstract: Animals and insects showcase remarkably robust and adept navigational abilities, up to literally circumnavigating the globe. Primary progress in robotics inspired by these natural systems has occurred in two areas: highly theoretical computational neuroscience models, and handcrafted systems like RatSLAM and NeuroSLAM. In this research, we present work bridging the gap between the two, in the form… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

    Comments: 8 Pages, 8 Figures, accepted at IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2023)

  25. arXiv:2307.03493  [pdf, other

    cs.AR cs.LG

    ITA: An Energy-Efficient Attention and Softmax Accelerator for Quantized Transformers

    Authors: Gamze İslamoğlu, Moritz Scherer, Gianna Paulin, Tim Fischer, Victor J. B. Jung, Angelo Garofalo, Luca Benini

    Abstract: Transformer networks have emerged as the state-of-the-art approach for natural language processing tasks and are gaining popularity in other domains such as computer vision and audio processing. However, the efficient hardware acceleration of transformer models poses new challenges due to their high arithmetic intensities, large memory requirements, and complex dataflow dependencies. In this work,… ▽ More

    Submitted 10 July, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: Accepted for publication at the 2023 ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED)

  26. FlooNoC: A Multi-Tbps Wide NoC for Heterogeneous AXI4 Traffic

    Authors: Tim Fischer, Michael Rogenmoser, Matheus Cavalcante, Frank K. Gürkaynak, Luca Benini

    Abstract: Meeting the staggering bandwidth requirements of today's applications challenges the traditional narrow and serialized NoCs, which hit hard bounds on the maximum operating frequency. This paper proposes FlooNoC, an open-source, low-latency, fully AXI4-compatible NoC with wide physical channels for latency-tolerant high-bandwidth non-blocking transactions and decoupled latency-critical short messag… ▽ More

    Submitted 6 August, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

  27. arXiv:2304.08408  [pdf, other

    cs.CV

    OVTrack: Open-Vocabulary Multiple Object Tracking

    Authors: Siyuan Li, Tobias Fischer, Lei Ke, Henghui Ding, Martin Danelljan, Fisher Yu

    Abstract: The ability to recognize, localize and track dynamic objects in a scene is fundamental to many real-world applications, such as self-driving and robotic systems. Yet, traditional multiple object tracking (MOT) benchmarks rely only on a few object categories that hardly represent the multitude of possible objects that are encountered in the real world. This leaves contemporary MOT methods limited t… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: CVPR 2023

  28. arXiv:2304.04640  [pdf, other

    cs.AI

    NeuroBench: A Framework for Benchmarking Neuromorphic Computing Algorithms and Systems

    Authors: Jason Yik, Korneel Van den Berghe, Douwe den Blanken, Younes Bouhadjar, Maxime Fabre, Paul Hueber, Denis Kleyko, Noah Pacik-Nelson, Pao-Sheng Vincent Sun, Guangzhi Tang, Shenqi Wang, Biyan Zhou, Soikat Hasan Ahmed, George Vathakkattil Joseph, Benedetto Leto, Aurora Micheli, Anurag Kumar Mishra, Gregor Lenz, Tao Sun, Zergham Ahmed, Mahmoud Akl, Brian Anderson, Andreas G. Andreou, Chiara Bartolozzi, Arindam Basu , et al. (73 additional authors not shown)

    Abstract: Neuromorphic computing shows promise for advancing computing efficiency and capabilities of AI applications using brain-inspired principles. However, the neuromorphic research field currently lacks standardized benchmarks, making it difficult to accurately measure technological advancements, compare performance with conventional methods, and identify promising future research directions. Prior neu… ▽ More

    Submitted 17 January, 2024; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: Updated from whitepaper to full perspective article preprint

  29. Visual Place Recognition: A Tutorial

    Authors: Stefan Schubert, Peer Neubert, Sourav Garg, Michael Milford, Tobias Fischer

    Abstract: Localization is an essential capability for mobile robots. A rapidly growing field of research in this area is Visual Place Recognition (VPR), which is the ability to recognize previously seen places in the world based solely on images. This present work is the first tutorial paper on visual place recognition. It unifies the terminology of VPR and complements prior research in two important direct… ▽ More

    Submitted 9 August, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: IEEE Robotics & Automation Magazine (RAM)

  30. arXiv:2303.00973  [pdf, other

    cs.CV cs.LG cs.RO

    Image Labels Are All You Need for Coarse Seagrass Segmentation

    Authors: Scarlett Raine, Ross Marchant, Brano Kusy, Frederic Maire, Tobias Fischer

    Abstract: Seagrass meadows serve as critical carbon sinks, but estimating the amount of carbon they store requires knowledge of the seagrass species present. Underwater and surface vehicles equipped with machine learning algorithms can help to accurately estimate the composition and extent of seagrass meadows at scale. However, previous approaches for seagrass detection and classification have required supe… ▽ More

    Submitted 5 September, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: 10 pages, 4 figures, additional 3 pages of supplementary material

    Journal ref: 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

  31. arXiv:2212.01247  [pdf, other

    cs.CV

    CC-3DT: Panoramic 3D Object Tracking via Cross-Camera Fusion

    Authors: Tobias Fischer, Yung-Hsu Yang, Suryansh Kumar, Min Sun, Fisher Yu

    Abstract: To track the 3D locations and trajectories of the other traffic participants at any given time, modern autonomous vehicles are equipped with multiple cameras that cover the vehicle's full surroundings. Yet, camera-based 3D object tracking methods prioritize optimizing the single-camera setup and resort to post-hoc fusion in a multi-camera setup. In this paper, we propose a method for panoramic 3D… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

    Comments: Project page: https://www.vis.xyz/pub/cc-3dt/

  32. arXiv:2212.00688  [pdf, other

    cs.AR

    TCN-CUTIE: A 1036 TOp/s/W, 2.72 uJ/Inference, 12.2 mW All-Digital Ternary Accelerator in 22 nm FDX Technology

    Authors: Moritz Scherer, Alfio Di Mauro, Tim Fischer, Georg Rutishauser, Luca Benini

    Abstract: Tiny Machine Learning (TinyML) applications impose uJ/Inference constraints, with a maximum power consumption of tens of mW. It is extremely challenging to meet these requirements at a reasonable accuracy level. This work addresses the challenge with a flexible, fully digital Ternary Neural Network (TNN) accelerator in a RISC-V-based System-on-Chip (SoC). Besides supporting Ternary Convolutional N… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: Accepted at IEEE MICRO Journal

  33. arXiv:2211.13989  [pdf, other

    cs.AR cs.DC cs.NI

    HexaMesh: Scaling to Hundreds of Chiplets with an Optimized Chiplet Arrangement

    Authors: Patrick Iff, Maciej Besta, Matheus Cavalcante, Tim Fischer, Luca Benini, Torsten Hoefler

    Abstract: 2.5D integration is an important technique to tackle the growing cost of manufacturing chips in advanced technology nodes. This poses the challenge of providing high-performance inter-chiplet interconnects (ICIs). As the number of chiplets grows to tens or hundreds, it becomes infeasible to hand-optimize their arrangement in a way that maximizes the ICI performance. In this paper, we propose HexaM… ▽ More

    Submitted 8 October, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

  34. arXiv:2211.13980  [pdf, other

    cs.AR cs.DC cs.NI

    Sparse Hamming Graph: A Customizable Network-on-Chip Topology

    Authors: Patrick Iff, Maciej Besta, Matheus Cavalcante, Tim Fischer, Luca Benini, Torsten Hoefler

    Abstract: Chips with hundreds to thousands of cores require scalable networks-on-chip (NoCs). Customization of the NoC topology is necessary to reach the diverse design goals of different chips. We introduce sparse Hamming graph, a novel NoC topology with an adjustable costperformance trade-off that is based on four NoC topology design principles we identified. To efficiently customize this topology, we dev… ▽ More

    Submitted 28 June, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

  35. Boosting Performance of a Baseline Visual Place Recognition Technique by Predicting the Maximally Complementary Technique

    Authors: Connor Malone, Stephen Hausler, Tobias Fischer, Michael Milford

    Abstract: One recent promising approach to the Visual Place Recognition (VPR) problem has been to fuse the place recognition estimates of multiple complementary VPR techniques using methods such as SRAL and multi-process fusion. These approaches come with a substantial practical limitation: they require all potential VPR methods to be brute-force run before they are selectively fused. The obvious solution t… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: 7 pages, 5 figures. arXiv admin note: text overlap with arXiv:2112.04701

    Journal ref: 2023 IEEE International Conference on Robotics and Automation (ICRA)

  36. arXiv:2210.06984  [pdf, other

    cs.CV

    QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking

    Authors: Tobias Fischer, Thomas E. Huang, Jiangmiao Pang, Linlu Qiu, Haofeng Chen, Trevor Darrell, Fisher Yu

    Abstract: Similarity learning has been recognized as a crucial step for object tracking. However, existing multiple object tracking methods only use sparse ground truth matching as the training objective, while ignoring the majority of the informative regions in images. In this paper, we present Quasi-Dense Similarity Learning, which densely samples hundreds of object regions on a pair of images for contras… ▽ More

    Submitted 27 September, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

  37. arXiv:2209.08723  [pdf, other

    cs.CV

    Ensembles of Compact, Region-specific & Regularized Spiking Neural Networks for Scalable Place Recognition

    Authors: Somayeh Hussaini, Michael Milford, Tobias Fischer

    Abstract: Spiking neural networks have significant potential utility in robotics due to their high energy efficiency on specialized hardware, but proof-of-concept implementations have not yet typically achieved competitive performance or capability with conventional approaches. In this paper, we tackle one of the key practical challenges of scalability by introducing a novel modular ensemble network approac… ▽ More

    Submitted 5 May, 2023; v1 submitted 18 September, 2022; originally announced September 2022.

    Comments: 8 pages, 6 figures, accepted to the IEEE International Conference on Robotics and Automation (ICRA) 2023

  38. arXiv:2208.13930  [pdf, other

    cs.CV

    SAFE: Sensitivity-Aware Features for Out-of-Distribution Object Detection

    Authors: Samuel Wilson, Tobias Fischer, Feras Dayoub, Dimity Miller, Niko Sünderhauf

    Abstract: We address the problem of out-of-distribution (OOD) detection for the task of object detection. We show that residual convolutional layers with batch normalisation produce Sensitivity-Aware FEatures (SAFE) that are consistently powerful for distinguishing in-distribution from out-of-distribution detections. We extract SAFE vectors for every detected object, and train a multilayer perceptron on the… ▽ More

    Submitted 22 August, 2023; v1 submitted 29 August, 2022; originally announced August 2022.

    Journal ref: IEEE International Conference on Computer Vision 2023

  39. arXiv:2207.03192  [pdf, other

    cs.AR

    MiniFloat-NN and ExSdotp: An ISA Extension and a Modular Open Hardware Unit for Low-Precision Training on RISC-V cores

    Authors: Luca Bertaccini, Gianna Paulin, Tim Fischer, Stefan Mach, Luca Benini

    Abstract: Low-precision formats have recently driven major breakthroughs in neural network (NN) training and inference by reducing the memory footprint of the NN models and improving the energy efficiency of the underlying hardware architectures. Narrow integer data types have been vastly investigated for NN inference and have successfully been pushed to the extreme of ternary and binary representations. In… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: This work has been submitted to the ARITH22 - IEEE Symposium on Computer Arithmetic for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible. 8 pages

  40. arXiv:2206.13673  [pdf, other

    cs.CV cs.AI cs.RO

    How Many Events do You Need? Event-based Visual Place Recognition Using Sparse But Varying Pixels

    Authors: Tobias Fischer, Michael Milford

    Abstract: Event cameras continue to attract interest due to desirable characteristics such as high dynamic range, low latency, virtually no motion blur, and high energy efficiency. One of the potential applications that would benefit from these characteristics lies in visual place recognition for robot localization, i.e. matching a query observation to the corresponding reference place in the database. In t… ▽ More

    Submitted 13 October, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: 8 pages

    Journal ref: IEEE Robotics and Automation Letters 2022

  41. Promoting Ethical Awareness in Communication Analysis: Investigating Potentials and Limits of Visual Analytics for Intelligence Applications

    Authors: Maximilian T. Fischer, Simon David Hirsbrunner, Wolfgang Jentner, Matthias Miller, Daniel A. Keim, Paula Helm

    Abstract: Digital systems for analyzing human communication data have become prevalent in recent years. Intelligence analysis of communications data in investigative journalism, criminal intelligence, and law present particularly interesting cases, as they must take into account the often highly sensitive properties of the underlying operations and data. At the same time, these are areas where increasingly… ▽ More

    Submitted 2 May, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: 13 pages, 4 figures

    Journal ref: Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency (FAccT '22), June 21--24, 2022, Seoul, Republic of Korea

  42. arXiv:2202.13487  [pdf, other

    cs.CV cs.LG cs.RO

    Point Label Aware Superpixels for Multi-species Segmentation of Underwater Imagery

    Authors: Scarlett Raine, Ross Marchant, Brano Kusy, Frederic Maire, Tobias Fischer

    Abstract: Monitoring coral reefs using underwater vehicles increases the range of marine surveys and availability of historical ecological data by collecting significant quantities of images. Analysis of this imagery can be automated using a model trained to perform semantic segmentation, however it is too costly and time-consuming to densely label images for training supervised models. In this letter, we l… ▽ More

    Submitted 10 July, 2022; v1 submitted 27 February, 2022; originally announced February 2022.

    Journal ref: IEEE Robotics and Automation Letters 2022, vol. 7, no. 3, pp. 8291-8298

  43. arXiv:2112.05341  [pdf, other

    cs.CV cs.AI

    Hyperdimensional Feature Fusion for Out-Of-Distribution Detection

    Authors: Samuel Wilson, Tobias Fischer, Niko Sünderhauf, Feras Dayoub

    Abstract: We introduce powerful ideas from Hyperdimensional Computing into the challenging field of Out-of-Distribution (OOD) detection. In contrast to most existing work that performs OOD detection based on only a single layer of a neural network, we use similarity-preserving semi-orthogonal projection matrices to project the feature maps from multiple layers into a common vector space. By repeatedly apply… ▽ More

    Submitted 29 August, 2022; v1 submitted 10 December, 2021; originally announced December 2021.

    Comments: Accepted to WACV2023

  44. arXiv:2112.04701  [pdf, other

    cs.CV

    Unsupervised Complementary-aware Multi-process Fusion for Visual Place Recognition

    Authors: Stephen Hausler, Tobias Fischer, Michael Milford

    Abstract: A recent approach to the Visual Place Recognition (VPR) problem has been to fuse the place recognition estimates of multiple complementary VPR techniques simultaneously. However, selecting the optimal set of techniques to use in a specific deployment environment a-priori is a difficult and unresolved challenge. Further, to the best of our knowledge, no method exists which can select a set of techn… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

  45. arXiv:2110.10756  [pdf, ps, other

    eess.SP cs.IT math.OC

    Ambiguities in Direction-of-Arrival Estimation with Linear Arrays

    Authors: Frederic Matter, Tobias Fischer, Marius Pesavento, Marc E. Pfetsch

    Abstract: In this paper, we present a novel approach to compute ambiguities in thinned uniform linear arrays, i.e., sparse non-uniform linear arrays, via a mixed-integer program. Ambiguities arise when there exists a set of distinct directions-of-arrival, for which the corresponding steering matrix is rank-deficient and are associated with nonunique parameter estimation. Our approach uses Young tableaux for… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

  46. Spiking Neural Networks for Visual Place Recognition via Weighted Neuronal Assignments

    Authors: Somayeh Hussaini, Michael Milford, Tobias Fischer

    Abstract: Spiking neural networks (SNNs) offer both compelling potential advantages, including energy efficiency and low latencies and challenges including the non-differentiable nature of event spikes. Much of the initial research in this area has converted deep neural networks to equivalent SNNs, but this conversion approach potentially negates some of the advantages of SNN-based approaches developed from… ▽ More

    Submitted 9 February, 2022; v1 submitted 14 September, 2021; originally announced September 2021.

    Comments: 8 pages, 6 figures, IEEE Robotics and Automation Letters (RA-L), also accepted to IEEE International Conference on Robotics and Automation (ICRA 2022)

    Journal ref: IEEE Robotics and Automation Letters 2022

  47. arXiv:2109.00097  [pdf, other

    cs.RO cs.CV

    Bio-inspired robot perception coupled with robot-modeled human perception

    Authors: Tobias Fischer

    Abstract: My overarching research goal is to provide robots with perceptional abilities that allow interactions with humans in a human-like manner. To develop these perceptional abilities, I believe that it is useful to study the principles of the human visual system. I use these principles to develop new computer vision algorithms and validate their effectiveness in intelligent robotic systems. I am enthus… ▽ More

    Submitted 31 August, 2021; originally announced September 2021.

    Comments: Paper accepted to the "Robotics: Science and Systems Pioneers Workshop 2021"

  48. Towards a Survey on Static and Dynamic Hypergraph Visualizations

    Authors: Maximilian T. Fischer, Alexander Frings, Daniel A. Keim, Daniel Seebacher

    Abstract: Leveraging hypergraph structures to model advanced processes has gained much attention over the last few years in many areas, ranging from protein-interaction in computational biology to image retrieval using machine learning. Hypergraph models can provide a more accurate representation of the underlying processes while reducing the overall number of links compared to regular representations. Howe… ▽ More

    Submitted 29 July, 2021; originally announced July 2021.

    Comments: 2021 IEEE Visualization Conference (VIS)

    Journal ref: 2021 IEEE Visualization Conference (VIS)

  49. Probabilistic Appearance-Invariant Topometric Localization with New Place Awareness

    Authors: Ming Xu, Tobias Fischer, Niko Sünderhauf, Michael Milford

    Abstract: Probabilistic state-estimation approaches offer a principled foundation for designing localization systems, because they naturally integrate sequences of imperfect motion and exteroceptive sensor data. Recently, probabilistic localization systems utilizing appearance-invariant visual place recognition (VPR) methods as the primary exteroceptive sensor have demonstrated state-of-the-art performance… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: 8 pages

    Journal ref: IEEE Robotics and Automation Letters and IROS 2021

  50. Communication Analysis through Visual Analytics: Current Practices, Challenges, and New Frontiers

    Authors: Maximilian T. Fischer, Frederik L. Dennig, Daniel Seebacher, Daniel A. Keim, Mennatallah El-Assady

    Abstract: The automated analysis of digital human communication data often focuses on specific aspects such as content or network structure in isolation. This can provide limited perspectives while making cross-methodological analyses, occurring in domains like investigative journalism, difficult. Communication research in psychology and the digital humanities instead stresses the importance of a holistic a… ▽ More

    Submitted 6 July, 2022; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: 11 pages, 2 tables, 1 figure

    Journal ref: 2022 IEEE Visualization in Data Science (VDS)