Skip to main content

Showing 1–50 of 383 results for author: Navab, N

  1. arXiv:2407.07015  [pdf, other

    cs.HC cs.MM cs.SD eess.AS

    A Framework for Multimodal Medical Image Interaction

    Authors: Laura Schütz, Sasan Matinfar, Gideon Schafroth, Navid Navab, Merle Fairhurst, Arthur Wagner, Benedikt Wiestler, Ulrich Eck, Nassir Navab

    Abstract: Medical doctors rely on images of the human anatomy, such as magnetic resonance imaging (MRI), to localize regions of interest in the patient during diagnosis and treatment. Despite advances in medical imaging technology, the information conveyance remains unimodal. This visual representation fails to capture the complexity of the real, multisensory interaction with human tissue. However, perceivi… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Accepted for publication in IEEE TVCG; presentation at IEEE ISMAR 2024

    ACM Class: H.5.2; H.5.5; H.5.1; J.3

  2. arXiv:2407.05428  [pdf, other

    eess.IV cs.CV

    Diffusion as Sound Propagation: Physics-inspired Model for Ultrasound Image Generation

    Authors: Marina Domínguez, Yordanka Velikova, Nassir Navab, Mohammad Farid Azampour

    Abstract: Deep learning (DL) methods typically require large datasets to effectively learn data distributions. However, in the medical field, data is often limited in quantity, and acquiring labeled data can be costly. To mitigate this data scarcity, data augmentation techniques are commonly employed. Among these techniques, generative models play a pivotal role in expanding datasets. However, when it comes… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  3. arXiv:2406.09801  [pdf, other

    cs.CV

    RaNeuS: Ray-adaptive Neural Surface Reconstruction

    Authors: Yida Wang, David Joseph Tan, Nassir Navab, Federico Tombari

    Abstract: Our objective is to leverage a differentiable radiance field \eg NeRF to reconstruct detailed 3D surfaces in addition to producing the standard novel view renderings. There have been related methods that perform such tasks, usually by utilizing a signed distance field (SDF). However, the state-of-the-art approaches still fail to correctly reconstruct the small-scale details, such as the leaves, ro… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 3DV 2024, oral. In: Proceedings of the IEEE/CVF International Conference on 3D Vision (2023)

  4. arXiv:2406.04100  [pdf, other

    cs.CV cs.RO

    Class-Aware Cartilage Segmentation for Autonomous US-CT Registration in Robotic Intercostal Ultrasound Imaging

    Authors: Zhongliang Jiang, Yunfeng Kang, Yuan Bi, Xuesong Li, Chenyang Li, Nassir Navab

    Abstract: Ultrasound imaging has been widely used in clinical examinations owing to the advantages of being portable, real-time, and radiation-free. Considering the potential of extensive deployment of autonomous examination systems in hospitals, robotic US imaging has attracted increased attention. However, due to the inter-patient variations, it is still challenging to have an optimal path for each patien… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  5. arXiv:2406.00644  [pdf, other

    cs.CV

    Ultrasound Report Generation with Cross-Modality Feature Alignment via Unsupervised Guidance

    Authors: Jun Li, Tongkun Su, Baoliang Zhao, Faqin Lv, Qiong Wang, Nassir Navab, Ying Hu, Zhongliang Jiang

    Abstract: Automatic report generation has arisen as a significant research area in computer-aided diagnosis, aiming to alleviate the burden on clinicians by generating reports automatically based on medical images. In this work, we propose a novel framework for automatic ultrasound report generation, leveraging a combination of unsupervised and supervised learning methods to aid the report generation proces… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  6. arXiv:2405.17606  [pdf, other

    cs.RO

    A Patient-Specific Framework for Autonomous Spinal Fixation via a Steerable Drilling Robot

    Authors: Susheela Sharma, Sarah Go, Zeynep Yakay, Yash Kulkarni, Siddhartha Kapuria, Jordan P. Amadio, Mohsen Khadem, Nassir Navab, Farshid Alambeigi

    Abstract: In this paper, with the goal of enhancing the minimally invasive spinal fixation procedure in osteoporotic patients, we propose a first-of-its-kind image-guided robotic framework for performing an autonomous and patient-specific procedure using a unique concentric tube steerable drilling robot (CT-SDR). Particularly, leveraging a CT-SDR, we introduce the concept of J-shape drilling based on a pre-… ▽ More

    Submitted 5 July, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: 10 pages, 3 figures. This paper has been accepted for publication at the 2024 International Conference on Medical Image Computing and Computer Assisted Interventions

  7. arXiv:2405.10075  [pdf, other

    cs.CV cs.AI

    HecVL: Hierarchical Video-Language Pretraining for Zero-shot Surgical Phase Recognition

    Authors: Kun Yuan, Vinkle Srivastav, Nassir Navab, Nicolas Padoy

    Abstract: Natural language could play an important role in developing generalist surgical models by providing a broad source of supervision from raw texts. This flexible form of supervision can enable the model's transferability across datasets and tasks as natural language can be used to reference learned visual concepts or describe new ones. In this work, we present HecVL, a novel hierarchical video-langu… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Accepted by MICCAI2024

  8. arXiv:2405.00915  [pdf, other

    cs.CV cs.AI cs.LG

    EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion

    Authors: Guangyao Zhai, Evin Pınar Örnek, Dave Zhenyu Chen, Ruotong Liao, Yan Di, Nassir Navab, Federico Tombari, Benjamin Busam

    Abstract: We present EchoScene, an interactive and controllable generative model that generates 3D indoor scenes on scene graphs. EchoScene leverages a dual-branch diffusion model that dynamically adapts to scene graphs. Existing methods struggle to handle scene graphs due to varying numbers of nodes, multiple edge combinations, and manipulator-induced node-edge operations. EchoScene overcomes this by assoc… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 25 pages. 10 figures

  9. arXiv:2404.19481  [pdf, other

    eess.IV cs.CV

    SpecstatOR: Speckle statistics-based iOCT Segmentation Network for Ophthalmic Surgery

    Authors: Kristina Mach, Hessam Roodaki, Michael Sommersperger, Nassir Navab

    Abstract: This paper presents an innovative approach to intraoperative Optical Coherence Tomography (iOCT) image segmentation in ophthalmic surgery, leveraging statistical analysis of speckle patterns to incorporate statistical pathology-specific prior knowledge. Our findings indicate statistically different speckle patterns within the retina and between retinal layers and surgical tools, facilitating the s… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  10. arXiv:2404.09927  [pdf, other

    cs.RO cs.LG

    Autonomous Path Planning for Intercostal Robotic Ultrasound Imaging Using Reinforcement Learning

    Authors: Yuan Bi, Cheng Qian, Zhicheng Zhang, Nassir Navab, Zhongliang Jiang

    Abstract: Ultrasound (US) has been widely used in daily clinical practice for screening internal organs and guiding interventions. However, due to the acoustic shadow cast by the subcutaneous rib cage, the US examination for thoracic application is still challenging. To fully cover and reconstruct the region of interest in US for diagnosis, an intercostal scanning path is necessary. To tackle this challenge… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  11. arXiv:2404.08805  [pdf, other

    eess.IV cs.CV cs.LG

    Real-time guidewire tracking and segmentation in intraoperative x-ray

    Authors: Baochang Zhang, Mai Bui, Cheng Wang, Felix Bourier, Heribert Schunkert, Nassir Navab

    Abstract: During endovascular interventions, physicians have to perform accurate and immediate operations based on the available real-time information, such as the shape and position of guidewires observed on the fluoroscopic images, haptic information and the patients' physiological signals. For this purpose, real-time and accurate guidewire segmentation and tracking can enhance the visualization of guidew… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  12. arXiv:2404.07668  [pdf, other

    eess.IV cs.CV

    Shape Completion in the Dark: Completing Vertebrae Morphology from 3D Ultrasound

    Authors: Miruna-Alexandra Gafencu, Yordanka Velikova, Mahdi Saleh, Tamas Ungi, Nassir Navab, Thomas Wendler, Mohammad Farid Azampour

    Abstract: Purpose: Ultrasound (US) imaging, while advantageous for its radiation-free nature, is challenging to interpret due to only partially visible organs and a lack of complete 3D information. While performing US-based diagnosis or investigation, medical professionals therefore create a mental map of the 3D anatomy. In this work, we aim to replicate this process and enhance the visual representation of… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  13. arXiv:2404.07031  [pdf, other

    cs.CV

    ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling

    Authors: Ege Özsoy, Chantal Pellegrini, Matthias Keicher, Nassir Navab

    Abstract: Every day, countless surgeries are performed worldwide, each within the distinct settings of operating rooms (ORs) that vary not only in their setups but also in the personnel, tools, and equipment used. This inherent diversity poses a substantial challenge for achieving a holistic understanding of the OR, as it requires models to generalize beyond their initial training datasets. To reduce this g… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 11 pages, 3 figures, 7 tables

  14. arXiv:2404.05584  [pdf, other

    cs.CV eess.IV

    Neural Cellular Automata for Lightweight, Robust and Explainable Classification of White Blood Cell Images

    Authors: Michael Deutges, Ario Sadafi, Nassir Navab, Carsten Marr

    Abstract: Diagnosis of hematological malignancies depends on accurate identification of white blood cells in peripheral blood smears. Deep learning techniques are emerging as a viable solution to scale and optimize this process by automatic identification of cells in laboratories. However, these techniques face several challenges such as limited generalizability, sensitivity to domain shifts and lack of exp… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  15. arXiv:2403.14523  [pdf, other

    eess.IV cs.CV

    Invisible Needle Detection in Ultrasound: Leveraging Mechanism-Induced Vibration

    Authors: Chenyang Li, Dianye Huang, Angelos Karlas, Nassir Navab, Zhongliang Jiang

    Abstract: In clinical applications that involve ultrasound-guided intervention, the visibility of the needle can be severely impeded due to steep insertion and strong distractors such as speckle noise and anatomical occlusion. To address this challenge, we propose VibNet, a learning-based framework tailored to enhance the robustness and accuracy of needle detection in ultrasound images, even when the target… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  16. arXiv:2403.14465  [pdf, other

    eess.IV cs.CV

    CathFlow: Self-Supervised Segmentation of Catheters in Interventional Ultrasound Using Optical Flow and Transformers

    Authors: Alex Ranne, Liming Kuang, Yordanka Velikova, Nassir Navab, Ferdinando Rodriguez y Baena

    Abstract: In minimally invasive endovascular procedures, contrast-enhanced angiography remains the most robust imaging technique. However, it is at the expense of the patient and clinician's health due to prolonged radiation exposure. As an alternative, interventional ultrasound has notable benefits such as being radiation-free, fast to deploy, and having a small footprint in the operating room. Yet, ultras… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  17. arXiv:2403.12198  [pdf, other

    cs.CV cs.GR cs.LG

    FLex: Joint Pose and Dynamic Radiance Fields Optimization for Stereo Endoscopic Videos

    Authors: Florian Philipp Stilz, Mert Asim Karaoglu, Felix Tristram, Nassir Navab, Benjamin Busam, Alexander Ladikos

    Abstract: Reconstruction of endoscopic scenes is an important asset for various medical applications, from post-surgery analysis to educational training. Neural rendering has recently shown promising results in endoscopic reconstruction with deforming tissue. However, the setup has been restricted to a static endoscope, limited deformation, or required an external tracking device to retrieve camera pose inf… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  18. arXiv:2403.01517  [pdf, other

    cs.CV

    MatchU: Matching Unseen Objects for 6D Pose Estimation from RGB-D Images

    Authors: Junwen Huang, Hao Yu, Kuan-Ting Yu, Nassir Navab, Slobodan Ilic, Benjamin Busam

    Abstract: Recent learning methods for object pose estimation require resource-intensive training for each individual object instance or category, hampering their scalability in real applications when confronted with previously unseen objects. In this paper, we propose MatchU, a Fuse-Describe-Match strategy for 6D pose estimation from RGB-D images. MatchU is a generic approach that fuses 2D texture and 3D ge… ▽ More

    Submitted 8 May, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

  19. arXiv:2402.03466  [pdf, other

    cs.CV cs.CG cs.RO

    Physics-Encoded Graph Neural Networks for Deformation Prediction under Contact

    Authors: Mahdi Saleh, Michael Sommersperger, Nassir Navab, Federico Tombari

    Abstract: In robotics, it's crucial to understand object deformation during tactile interactions. A precise understanding of deformation can elevate robotic simulations and have broad implications across different industries. We introduce a method using Physics-Encoded Graph Neural Networks (GNNs) for such predictions. Similar to robotic grasping and manipulation scenarios, we focus on modeling the dynamics… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted at 2024 IEEE International Conference on Robotics and Automation (ICRA2024)

  20. arXiv:2401.02539  [pdf, other

    cs.RO cs.CV

    Robot-Assisted Deep Venous Thrombosis Ultrasound Examination using Virtual Fixture

    Authors: Dianye Huang, Chenguang Yang, Mingchuan Zhou, Angelos Karlas, Nassir Navab, Zhongliang Jiang

    Abstract: Deep Venous Thrombosis (DVT) is a common vascular disease with blood clots inside deep veins, which may block blood flow or even cause a life-threatening pulmonary embolism. A typical exam for DVT using ultrasound (US) imaging is by pressing the target vein until its lumen is fully compressed. However, the compression exam is highly operator-dependent. To alleviate intra- and inter-variations, we… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: Accepted Paper IEEE T-ASE

  21. arXiv:2401.02376  [pdf, other

    cs.RO

    Machine Learning in Robotic Ultrasound Imaging: Challenges and Perspectives

    Authors: Yuan Bi, Zhongliang Jiang, Felix Duelmer, Dianye Huang, Nassir Navab

    Abstract: This article reviews the recent advances in intelligent robotic ultrasound (US) imaging systems. We commence by presenting the commonly employed robotic mechanisms and control techniques in robotic US imaging, along with their clinical applications. Subsequently, we focus on the deployment of machine learning techniques in the development of robotic sonographers, emphasizing crucial developments a… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: Accepted by Annual Review of Control, Robotics, and Autonomous Systems

  22. arXiv:2401.00633  [pdf, other

    cs.LG

    On Discprecncies between Perturbation Evaluations of Graph Neural Network Attributions

    Authors: Razieh Rezaei, Alireza Dizaji, Ashkan Khakzar, Anees Kazi, Nassir Navab, Daniel Rueckert

    Abstract: Neural networks are increasingly finding their way into the realm of graphs and modeling relationships between features. Concurrently graph neural network explanation approaches are being invented to uncover relationships between the nodes of the graphs. However, there is a disparity between the existing attribution methods, and it is unclear which attribution to trust. Therefore research has intr… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

  23. Interactive Shape Sonification for Tumor Localization in Breast Cancer Surgery

    Authors: Laura Schütz, Trishia El Chemaly, Emmanuelle Weber, Anh Thien Doan, Jacqueline Tsai, Christoph Leuze, Bruce Daniel, Nassir Navab

    Abstract: About 20 percent of patients undergoing breast-conserving surgery require reoperation due to cancerous tissue remaining inside the breast. Breast cancer localization systems utilize auditory feedback to convey the distance between a localization probe and a small marker (seed) implanted into the breast tumor prior to surgery. However, no information on the location of the tumor margin is provided.… ▽ More

    Submitted 28 January, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: 15 pages, 9 figures

    ACM Class: H.5.2; H.5.5; J.3

    Journal ref: Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '24), May 11-16, 2024, Honolulu, HI, USA. ACM, New York, NY, USA

  24. arXiv:2312.15059  [pdf, other

    cs.CV cs.AI

    Deformable 3D Gaussian Splatting for Animatable Human Avatars

    Authors: HyunJun Jung, Nikolas Brasch, Jifei Song, Eduardo Perez-Pellitero, Yiren Zhou, Zhihao Li, Nassir Navab, Benjamin Busam

    Abstract: Recent advances in neural radiance fields enable novel view synthesis of photo-realistic images in dynamic settings, which can be applied to scenarios with human animation. Commonly used implicit backbones to establish accurate models, however, require many input views and additional annotations such as human masks, UV maps and depth maps. In this work, we propose ParDy-Human (Parameterized Dynami… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  25. arXiv:2312.10251  [pdf, other

    cs.CV cs.AI

    Advancing Surgical VQA with Scene Graph Knowledge

    Authors: Kun Yuan, Manasi Kattel, Joel L. Lavanchy, Nassir Navab, Vinkle Srivastav, Nicolas Padoy

    Abstract: Modern operating room is becoming increasingly complex, requiring innovative intra-operative support systems. While the focus of surgical data science has largely been on video analysis, integrating surgical computer vision with language capabilities is emerging as a necessity. Our work aims to advance Visual Question Answering (VQA) in the surgical context with scene graph knowledge, addressing t… ▽ More

    Submitted 24 June, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: IPCAI 2024, Int J CARS (2024)

  26. arXiv:2312.03678  [pdf, other

    cs.CV

    Hybrid Functional Maps for Crease-Aware Non-Isometric Shape Matching

    Authors: Lennart Bastian, Yizheng Xie, Nassir Navab, Zorah Lähner

    Abstract: Non-isometric shape correspondence remains a fundamental challenge in computer vision. Traditional methods using Laplace-Beltrami operator (LBO) eigenmodes face limitations in characterizing high-frequency extrinsic shape changes like bending and creases. We propose a novel approach of combining the non-orthogonal extrinsic basis of eigenfunctions of the elastic thin-shell hessian with the intrins… ▽ More

    Submitted 17 April, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: CVPR 2024

  27. arXiv:2312.02255  [pdf, other

    cs.CV cs.GR cs.LG

    Re-Nerfing: Improving Novel Views Synthesis through Novel Views Synthesis

    Authors: Felix Tristram, Stefano Gasperini, Nassir Navab, Federico Tombari

    Abstract: Neural Radiance Fields (NeRFs) have shown remarkable novel view synthesis capabilities even in large-scale, unbounded scenes, albeit requiring hundreds of views or introducing artifacts in sparser settings. Their optimization suffers from shape-radiance ambiguities wherever only a small visual overlap is available. This leads to erroneous scene geometry and artifacts. In this paper, we propose Re-… ▽ More

    Submitted 17 April, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Code will be released upon acceptance

  28. arXiv:2312.01105  [pdf, other

    cs.CV

    S2P3: Self-Supervised Polarimetric Pose Prediction

    Authors: Patrick Ruhkamp, Daoyi Gao, Nassir Navab, Benjamin Busam

    Abstract: This paper proposes the first self-supervised 6D object pose prediction from multimodal RGB+polarimetric images. The novel training paradigm comprises 1) a physical model to extract geometric information of polarized light, 2) a teacher-student knowledge distillation scheme and 3) a self-supervised loss formulation through differentiable rendering and an invertible physical constraint. Both networ… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: Accepted at IJCV

  29. arXiv:2312.00204  [pdf, other

    cs.CV

    DNS SLAM: Dense Neural Semantic-Informed SLAM

    Authors: Kunyi Li, Michael Niemeyer, Nassir Navab, Federico Tombari

    Abstract: In recent years, coordinate-based neural implicit representations have shown promising results for the task of Simultaneous Localization and Mapping (SLAM). While achieving impressive performance on small synthetic scenes, these methods often suffer from oversmoothed reconstructions, especially for complex real-world scenes. In this work, we introduce DNS SLAM, a novel neural RGB-D semantic SLAM a… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  30. arXiv:2311.18681  [pdf, other

    cs.CV cs.CL

    RaDialog: A Large Vision-Language Model for Radiology Report Generation and Conversational Assistance

    Authors: Chantal Pellegrini, Ege Özsoy, Benjamin Busam, Nassir Navab, Matthias Keicher

    Abstract: Conversational AI tools that can generate and discuss clinically correct radiology reports for a given medical image have the potential to transform radiology. Such a human-in-the-loop radiology assistant could facilitate a collaborative diagnostic process, thus saving time and improving the quality of reports. Towards this goal, we introduce RaDialog, the first thoroughly evaluated and publicly a… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 12 pages, 7 figures

  31. arXiv:2311.11782  [pdf, other

    eess.IV cs.CV cs.LG

    Robust Tumor Segmentation with Hyperspectral Imaging and Graph Neural Networks

    Authors: Mayar Lotfy, Anna Alperovich, Tommaso Giannantonio, Bjorn Barz, Xiaohan Zhang, Felix Holm, Nassir Navab, Felix Boehm, Carolin Schwamborn, Thomas K. Hoffmann, Patrick J. Schuler

    Abstract: Segmenting the boundary between tumor and healthy tissue during surgical cancer resection poses a significant challenge. In recent years, Hyperspectral Imaging (HSI) combined with Machine Learning (ML) has emerged as a promising solution. However, due to the extensive information contained within the spectral domain, most ML approaches primarily classify individual HSI (super-)pixels, or tiles, wi… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 11 pages, 6 figures

  32. arXiv:2311.11125  [pdf, other

    cs.CV

    SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation

    Authors: Yamei Chen, Yan Di, Guangyao Zhai, Fabian Manhardt, Chenyangguang Zhang, Ruida Zhang, Federico Tombari, Nassir Navab, Benjamin Busam

    Abstract: Category-level object pose estimation, aiming to predict the 6D pose and 3D size of objects from known categories, typically struggles with large intra-class shape variation. Existing works utilizing mean shapes often fall short of capturing this variation. To address this issue, we present SecondPose, a novel approach integrating object-specific geometric features with semantic category priors fr… ▽ More

    Submitted 21 March, 2024; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: CVPR 2024 accepted. Code is available at: https://github.com/NOrangeeroli/SecondPose

  33. arXiv:2311.08799  [pdf, other

    cs.RO cs.CV

    EyeLS: Shadow-Guided Instrument Landing System for Intraocular Target Approaching in Robotic Eye Surgery

    Authors: Junjie Yang, Zhihao Zhao, Siyuan Shen, Daniel Zapp, Mathias Maier, Kai Huang, Nassir Navab, M. Ali Nasseri

    Abstract: Robotic ophthalmic surgery is an emerging technology to facilitate high-precision interventions such as retina penetration in subretinal injection and removal of floating tissues in retinal detachment depending on the input imaging modalities such as microscopy and intraoperative OCT (iOCT). Although iOCT is explored to locate the needle tip within its range-limited ROI, it is still difficult to c… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 10 pages

  34. arXiv:2311.05289  [pdf, other

    cs.CV cs.RO

    VoxNeRF: Bridging Voxel Representation and Neural Radiance Fields for Enhanced Indoor View Synthesis

    Authors: Sen Wang, Wei Zhang, Stefano Gasperini, Shun-Cheng Wu, Nassir Navab

    Abstract: Creating high-quality view synthesis is essential for immersive applications but continues to be problematic, particularly in indoor environments and for real-time deployment. Current techniques frequently require extensive computational time for both training and rendering, and often produce less-than-ideal 3D representations due to inadequate geometric structuring. To overcome this, we introduce… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 8 pages, 4 figures

  35. arXiv:2311.04999  [pdf, other

    cs.RO

    Implicit Neural Representations for Breathing-compensated Volume Reconstruction in Robotic Ultrasound

    Authors: Yordanka Velikova, Mohammad Farid Azampour, Walter Simson, Marco Esposito, Nassir Navab

    Abstract: Ultrasound (US) imaging is widely used in diagnosing and staging abdominal diseases due to its lack of non-ionizing radiation and prevalent availability. However, significant inter-operator variability and inconsistent image acquisition hinder the widespread adoption of extensive screening programs. Robotic ultrasound systems have emerged as a promising solution, offering standardized acquisition… ▽ More

    Submitted 3 April, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  36. arXiv:2311.02247  [pdf, other

    cs.LG

    PRISM: Progressive Restoration for Scene Graph-based Image Manipulation

    Authors: Pavel Jahoda, Azade Farshad, Yousef Yeganeh, Ehsan Adeli, Nassir Navab

    Abstract: Scene graphs have emerged as accurate descriptive priors for image generation and manipulation tasks, however, their complexity and diversity of the shapes and relations of objects in data make it challenging to incorporate them into the models and generate high-quality results. To address these challenges, we propose PRISM, a novel progressive multi-head image manipulation approach to improve the… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  37. arXiv:2309.14538  [pdf, other

    cs.CV

    Dynamic Scene Graph Representation for Surgical Video

    Authors: Felix Holm, Ghazal Ghazaei, Tobias Czempiel, Ege Özsoy, Stefan Saur, Nassir Navab

    Abstract: Surgical videos captured from microscopic or endoscopic imaging devices are rich but complex sources of information, depicting different tools and anatomical structures utilized during an extended amount of time. Despite containing crucial workflow information and being commonly recorded in many procedures, usage of surgical videos for automated surgical workflow understanding is still limited.… ▽ More

    Submitted 24 October, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

  38. arXiv:2309.14492  [pdf, other

    eess.IV cs.CV cs.RO

    AiAReSeg: Catheter Detection and Segmentation in Interventional Ultrasound using Transformers

    Authors: Alex Ranne, Yordanka Velikova, Nassir Navab, Ferdinando Rodriguez y Baena

    Abstract: To date, endovascular surgeries are performed using the golden standard of Fluoroscopy, which uses ionising radiation to visualise catheters and vasculature. Prolonged Fluoroscopic exposure is harmful for the patient and the clinician, and may lead to severe post-operative sequlae such as the development of cancer. Meanwhile, the use of interventional Ultrasound has gained popularity, due to its w… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: This work has been submitted to the IEEE for possible publication

  39. arXiv:2309.12188  [pdf, other

    cs.RO cs.CV

    SG-Bot: Object Rearrangement via Coarse-to-Fine Robotic Imagination on Scene Graphs

    Authors: Guangyao Zhai, Xiaoni Cai, Dianye Huang, Yan Di, Fabian Manhardt, Federico Tombari, Nassir Navab, Benjamin Busam

    Abstract: Object rearrangement is pivotal in robotic-environment interactions, representing a significant capability in embodied AI. In this paper, we present SG-Bot, a novel rearrangement framework that utilizes a coarse-to-fine scheme with a scene graph as the scene representation. Unlike previous methods that rely on either known goal priors or zero-shot large models, SG-Bot exemplifies lightweight, real… ▽ More

    Submitted 24 March, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: ICRA 2024 accepted. Project website: https://sites.google.com/view/sg-bot

  40. arXiv:2309.09563  [pdf, other

    cs.CV

    RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy

    Authors: Mert Asim Karaoglu, Viktoria Markova, Nassir Navab, Benjamin Busam, Alexander Ladikos

    Abstract: Unlike in natural images, in endoscopy there is no clear notion of an up-right camera orientation. Endoscopic videos therefore often contain large rotational motions, which require keypoint detection and description algorithms to be robust to these conditions. While most classical methods achieve rotation-equivariant detection and invariant description by design, many learning-based approaches lea… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 8 pages, 5 figures

  41. arXiv:2309.08927  [pdf, other

    cs.CV

    DynaMoN: Motion-Aware Fast and Robust Camera Localization for Dynamic Neural Radiance Fields

    Authors: Nicolas Schischka, Hannah Schieber, Mert Asim Karaoglu, Melih Görgülü, Florian Grötzner, Alexander Ladikos, Daniel Roth, Nassir Navab, Benjamin Busam

    Abstract: The accurate reconstruction of dynamic scenes with neural radiance fields is significantly dependent on the estimation of camera poses. Widely used structure-from-motion pipelines encounter difficulties in accurately tracking the camera trajectory when faced with separate dynamics of the scene content and the camera movement. To address this challenge, we propose DynaMoN. DynaMoN utilizes semantic… ▽ More

    Submitted 17 March, 2024; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: 6 pages, 4 figures

  42. arXiv:2309.02965  [pdf, other

    cs.CV

    Dynamic Hyperbolic Attention Network for Fine Hand-object Reconstruction

    Authors: Zhiying Leng, Shun-Cheng Wu, Mahdi Saleh, Antonio Montanaro, Hao Yu, Yin Wang, Nassir Navab, Xiaohui Liang, Federico Tombari

    Abstract: Reconstructing both objects and hands in 3D from a single RGB image is complex. Existing methods rely on manually defined hand-object constraints in Euclidean space, leading to suboptimal feature learning. Compared with Euclidean space, hyperbolic space better preserves the geometric properties of meshes thanks to its exponentially-growing space distance, which amplifies the differences between th… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: Accpeted by ICCV 2023

    ACM Class: I.4.5

  43. arXiv:2309.01865  [pdf, other

    eess.IV cs.AI

    BigFUSE: Global Context-Aware Image Fusion in Dual-View Light-Sheet Fluorescence Microscopy with Image Formation Prior

    Authors: Yu Liu, Gesine Muller, Nassir Navab, Carsten Marr, Jan Huisken, Tingying Peng

    Abstract: Light-sheet fluorescence microscopy (LSFM), a planar illumination technique that enables high-resolution imaging of samples, experiences defocused image quality caused by light scattering when photons propagate through thick tissues. To circumvent this issue, dualview imaging is helpful. It allows various sections of the specimen to be scanned ideally by viewing the sample from opposing orientatio… ▽ More

    Submitted 3 November, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: paper in MICCAI 2023

  44. arXiv:2309.00372  [pdf, other

    eess.IV cs.CV

    On the Localization of Ultrasound Image Slices within Point Distribution Models

    Authors: Lennart Bastian, Vincent Bürgin, Ha Young Kim, Alexander Baumann, Benjamin Busam, Mahdi Saleh, Nassir Navab

    Abstract: Thyroid disorders are most commonly diagnosed using high-resolution Ultrasound (US). Longitudinal nodule tracking is a pivotal diagnostic protocol for monitoring changes in pathological thyroid morphology. This task, however, imposes a substantial cognitive load on clinicians due to the inherent challenge of maintaining a mental 3D reconstruction of the organ. We thus present a framework for autom… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: ShapeMI Workshop @ MICCAI 2023; 12 pages 2 figures

  45. 3D Adversarial Augmentations for Robust Out-of-Domain Predictions

    Authors: Alexander Lehner, Stefano Gasperini, Alvaro Marcos-Ramiro, Michael Schmidt, Nassir Navab, Benjamin Busam, Federico Tombari

    Abstract: Since real-world training datasets cannot properly sample the long tail of the underlying data distribution, corner cases and rare out-of-domain samples can severely hinder the performance of state-of-the-art models. This problem becomes even more severe for dense tasks, such as 3D semantic segmentation, where points of non-standard objects can be confidently associated to the wrong class. In this… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: 37 pages, 12 figures

  46. arXiv:2308.12679  [pdf, other

    cs.CV cs.LG

    A Continual Learning Approach for Cross-Domain White Blood Cell Classification

    Authors: Ario Sadafi, Raheleh Salehi, Armin Gruber, Sayedali Shetab Boushehri, Pascal Giehr, Nassir Navab, Carsten Marr

    Abstract: Accurate classification of white blood cells in peripheral blood is essential for diagnosing hematological diseases. Due to constantly evolving clinical settings, data sources, and disease classifications, it is necessary to update machine learning classification models regularly for practical real-world use. Such models significantly benefit from sequentially learning from incoming data streams w… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: Accepted for publication at workshop on Domain Adaptation and Representation Transfer (DART) in International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2023)

  47. arXiv:2308.12675  [pdf, other

    cs.CV eess.IV

    A Study of Age and Sex Bias in Multiple Instance Learning based Classification of Acute Myeloid Leukemia Subtypes

    Authors: Ario Sadafi, Matthias Hehr, Nassir Navab, Carsten Marr

    Abstract: Accurate classification of Acute Myeloid Leukemia (AML) subtypes is crucial for clinical decision-making and patient care. In this study, we investigate the potential presence of age and sex bias in AML subtype classification using Multiple Instance Learning (MIL) architectures. To that end, we train multiple MIL models using different levels of sex imbalance in the training set and excluding cert… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: Accepted for publication at workshop on Fairness of AI in Medical Imaging in International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2023)

  48. arXiv:2308.10627  [pdf, other

    cs.CV

    Polarimetric Information for Multi-Modal 6D Pose Estimation of Photometrically Challenging Objects with Limited Data

    Authors: Patrick Ruhkamp, Daoyi Gao, HyunJun Jung, Nassir Navab, Benjamin Busam

    Abstract: 6D pose estimation pipelines that rely on RGB-only or RGB-D data show limitations for photometrically challenging objects with e.g. textureless surfaces, reflections or transparency. A supervised learning-based method utilising complementary polarisation information as input modality is proposed to overcome such limitations. This supervised approach is then extended to a self-supervised paradigm b… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted at ICCV 2023 TRICKY Workshop

  49. arXiv:2308.10621  [pdf, other

    cs.CV

    Multi-Modal Dataset Acquisition for Photometrically Challenging Object

    Authors: HyunJun Jung, Patrick Ruhkamp, Nassir Navab, Benjamin Busam

    Abstract: This paper addresses the limitations of current datasets for 3D vision tasks in terms of accuracy, size, realism, and suitable imaging modalities for photometrically challenging objects. We propose a novel annotation and acquisition pipeline that enhances existing 3D perception and 6D object pose datasets. Our approach integrates robotic forward-kinematics, external infrared trackers, and improved… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted at ICCV 2023 TRICKY Workshop

  50. Robust Monocular Depth Estimation under Challenging Conditions

    Authors: Stefano Gasperini, Nils Morbitzer, HyunJun Jung, Nassir Navab, Federico Tombari

    Abstract: While state-of-the-art monocular depth estimation approaches achieve impressive results in ideal settings, they are highly unreliable under challenging illumination and weather conditions, such as at nighttime or in the presence of rain. In this paper, we uncover these safety-critical issues and tackle them with md4all: a simple and effective solution that works reliably under both adverse and ide… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: ICCV 2023. Source code and data: https://md4all.github.io