Skip to main content

Showing 1–8 of 8 results for author: Pu, B

  1. arXiv:2406.11333  [pdf, other

    cs.CV

    Hallucination Mitigation Prompts Long-term Video Understanding

    Authors: Yiwei Sun, Zhihang Liu, Chuanbin Liu, Bowei Pu, Zhihan Zhang, Hongtao Xie

    Abstract: Recently, multimodal large language models have made significant advancements in video understanding tasks. However, their ability to understand unprocessed long videos is very limited, primarily due to the difficulty in supporting the enormous memory overhead. Although existing methods achieve a balance between memory and information by aggregating frames, they inevitably introduce the severe hal… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2403.09195  [pdf, other

    cs.CV

    SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash Attention to Achieve 30 times Acceleration

    Authors: Yanfei Song, Bangzheng Pu, Peng Wang, Hongxu Jiang, Dong Dong, Yongxiang Cao, Yiqing Shen

    Abstract: Segment Anything Model (SAM) has garnered significant attention in segmentation tasks due to their zero-shot generalization ability. However, a broader application of SAMs to real-world practice has been restricted by their low inference speed and high computational memory demands, which mainly stem from the attention mechanism. Existing work concentrated on optimizing the encoder, yet has not ade… ▽ More

    Submitted 17 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  3. arXiv:2308.07717  [pdf, other

    cs.CV

    Real-time Automatic M-mode Echocardiography Measurement with Panel Attention from Local-to-Global Pixels

    Authors: Ching-Hsun Tseng, Shao-Ju Chien, Po-Shen Wang, Shin-Jye Lee, Wei-Huan Hu, Bin Pu, Xiao-jun Zeng

    Abstract: Motion mode (M-mode) recording is an essential part of echocardiography to measure cardiac dimension and function. However, the current diagnosis cannot build an automatic scheme, as there are three fundamental obstructs: Firstly, there is no open dataset available to build the automation for ensuring constant results and bridging M-mode echocardiography with real-time instance segmentation (RIS);… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  4. arXiv:2303.09858  [pdf, other

    eess.IV cs.CR cs.CV cs.MM

    Preventing Unauthorized AI Over-Analysis by Medical Image Adversarial Watermarking

    Authors: Xingxing Wei, Bangzheng Pu, Shiji Zhao, Chen Chi, Huazhu Fu

    Abstract: The advancement of deep learning has facilitated the integration of Artificial Intelligence (AI) into clinical practices, particularly in computer-aided diagnosis. Given the pivotal role of medical images in various diagnostic procedures, it becomes imperative to ensure the responsible and secure utilization of AI techniques. However, the unauthorized utilization of AI for image analysis raises si… ▽ More

    Submitted 13 September, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

  5. arXiv:2211.01671  [pdf, other

    cs.CV

    Visually Adversarial Attacks and Defenses in the Physical World: A Survey

    Authors: Xingxing Wei, Bangzheng Pu, Jiefan Lu, Baoyuan Wu

    Abstract: Although Deep Neural Networks (DNNs) have been widely applied in various real-world scenarios, they are vulnerable to adversarial examples. The current adversarial attacks in computer vision can be divided into digital attacks and physical attacks according to their different attack forms. Compared with digital attacks, which generate perturbations in the digital pixels, physical attacks are more… ▽ More

    Submitted 13 July, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

  6. arXiv:2206.12340  [pdf

    cs.HC cs.SD eess.AS

    How to hide your voice: Noise-cancelling bird photography blind

    Authors: Caner Baydur, Baojing Pu, Xiaoqing Xu

    Abstract: Getting close to birds is a great challenge in wildlife photography. Bird photography blinds may be the most effective and least intrusive way if properly designed. However, the acoustic design of the blinds has been overlooked so far. Herein, we present noise-cancelling blinds which allow photographing birds at close range. Firstly, we conduct a questionnaire in the eco-tourism centre located in… ▽ More

    Submitted 27 November, 2022; v1 submitted 24 June, 2022; originally announced June 2022.

    Comments: 26 pages, 11 figures. Revised argument in sections 2 and 4, results unchanged, references added

    Journal ref: Environmental Science and Pollution Research, 1-14; 2023

  7. arXiv:2106.10693  [pdf, other

    cs.LG cs.AI

    Fast PDN Impedance Prediction Using Deep Learning

    Authors: Ling Zhang, Jack Juang, Zurab Kiguradze, Bo Pu, Shuai Jin, Songping Wu, Zhiping Yang, Chulsoon Hwang

    Abstract: Modeling and simulating a power distribution network (PDN) for printed circuit boards (PCBs) with irregular board shapes and multi-layer stackup is computationally inefficient using full-wave simulations. This paper presents a new concept of using deep learning for PDN impedance prediction. A boundary element method (BEM) is applied to efficiently calculate the impedance for arbitrary board shape… ▽ More

    Submitted 20 June, 2021; originally announced June 2021.

  8. arXiv:1905.00565  [pdf, other

    cs.DC

    Parallelizing Convergent Cross Mapping Using Apache Spark

    Authors: Bo Pu, Lujie Duan, Nathaniel Osgood

    Abstract: Identifying the causal relationships between subjects or variables remains an important problem across various scientific fields. This is particularly important but challenging in complex systems, such as those involving human behavior, sociotechnical contexts, and natural ecosystems. By exploiting state space reconstruction via lagged embedding of time series, convergent cross mapping (CCM) serve… ▽ More

    Submitted 1 May, 2019; originally announced May 2019.

    Comments: 11 pages, 5 figures, SBP conference paper