Skip to main content

Showing 1–7 of 7 results for author: Leng, H

  1. arXiv:2308.13794  [pdf, other

    cs.CV

    SOGDet: Semantic-Occupancy Guided Multi-view 3D Object Detection

    Authors: Qiu Zhou, Jinming Cao, Hanchao Leng, Yifang Yin, Yu Kun, Roger Zimmermann

    Abstract: In the field of autonomous driving, accurate and comprehensive perception of the 3D environment is crucial. Bird's Eye View (BEV) based methods have emerged as a promising solution for 3D object detection using multi-view images as input. However, existing 3D object detection methods often ignore the physical context in the environment, such as sidewalk and vegetation, resulting in sub-optimal per… ▽ More

    Submitted 6 January, 2024; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: Accepted by AAAI2024

  2. arXiv:2308.05784  [pdf, other

    eess.IV cs.CV

    High-performance Data Management for Whole Slide Image Analysis in Digital Pathology

    Authors: Haoju Leng, Ruining Deng, Shunxing Bao, Dazheng Fang, Bryan A. Millis, Yucheng Tang, Haichun Yang, Xiao Wang, Yifan Peng, Lipeng Wan, Yuankai Huo

    Abstract: When dealing with giga-pixel digital pathology in whole-slide imaging, a notable proportion of data records holds relevance during each analysis operation. For instance, when deploying an image analysis algorithm on whole-slide images (WSI), the computational bottleneck often lies in the input-output (I/O) system. This is particularly notable as patch-level processing introduces a considerable I/O… ▽ More

    Submitted 20 August, 2023; v1 submitted 10 August, 2023; originally announced August 2023.

  3. arXiv:2306.02923  [pdf, other

    cs.CL

    MidMed: Towards Mixed-Type Dialogues for Medical Consultation

    Authors: Xiaoming Shi, Zeming Liu, Chuan Wang, Haitao Leng, Kui Xue, Xiaofan Zhang, Shaoting Zhang

    Abstract: Most medical dialogue systems assume that patients have clear goals (medicine querying, surgical operation querying, etc.) before medical consultation. However, in many real scenarios, due to the lack of medical knowledge, it is usually difficult for patients to determine clear goals with all necessary slots. In this paper, we identify this challenge as how to construct medical consultation dialog… ▽ More

    Submitted 13 June, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted by ACL 2023 main conference. The first two authors contributed equally to this work

  4. arXiv:2305.14566  [pdf, other

    eess.IV cs.CV

    An Accelerated Pipeline for Multi-label Renal Pathology Image Segmentation at the Whole Slide Image Level

    Authors: Haoju Leng, Ruining Deng, Zuhayr Asad, R. Michael Womick, Haichun Yang, Lipeng Wan, Yuankai Huo

    Abstract: Deep-learning techniques have been used widely to alleviate the labour-intensive and time-consuming manual annotation required for pixel-level tissue characterization. Our previous study introduced an efficient single dynamic network - Omni-Seg - that achieved multi-class multi-scale pathological segmentation with less computational complexity. However, the patch-wise segmentation paradigm still a… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  5. arXiv:2302.12963  [pdf

    cs.NE

    A Surrogate-Assisted Highly Cooperative Coevolutionary Algorithm for Hyperparameter Optimization in Deep Convolutional Neural Network

    Authors: An Chen, Zhigang Ren, Muyi Wang, Hui Chen, Haoxi Leng, Shuai Liu

    Abstract: Convolutional neural networks (CNNs) have gained remarkable success in recent years. However, their performance highly relies on the architecture hyperparameters, and finding proper hyperparameters for a deep CNN is a challenging optimization problem owing to its high-dimensional and computationally expensive characteristics. Given these difficulties, this study proposes a surrogate-assisted highl… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  6. arXiv:2212.05489  [pdf, other

    cs.HC

    AliCHI: A Large-scale Multi-modal Dataset and Automated Evaluation Tool for Human-like Dialogue Systems

    Authors: Zhiling Luo, Qiankun Shi, Sha Zhao, Wei Zhou, Haiqing Chen, Yuankai Ma, Haitao Leng

    Abstract: A well-designed interactive human-like dialogue system is expected to take actions (e.g. smiling) and respond in a pattern similar to humans. However, due to the limitation of single-modality (only speech) or small volume of currently public datasets, most dialogue systems can only respond in speech and cannot take human-like actions. In this work, we build a large-scale multi-modal dataset of hum… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  7. arXiv:2108.10528  [pdf, other

    cs.CV

    ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation

    Authors: Jinming Cao, Hanchao Leng, Dani Lischinski, Danny Cohen-Or, Changhe Tu, Yangyan Li

    Abstract: RGB-D semantic segmentation has attracted increasing attention over the past few years. Existing methods mostly employ homogeneous convolution operators to consume the RGB and depth features, ignoring their intrinsic differences. In fact, the RGB values capture the photometric appearance properties in the projected image space, while the depth feature encodes both the shape of a local geometry as… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

    Comments: ICCV2021