Skip to main content

Showing 1–8 of 8 results for author: Zhuge, Y

  1. arXiv:2407.07523  [pdf, other

    cs.CV cs.MM

    SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning

    Authors: Haiwen Diao, Bo Wan, Xu Jia, Yunzhi Zhuge, Ying Zhang, Huchuan Lu, Long Chen

    Abstract: Parameter-efficient transfer learning (PETL) has emerged as a flourishing research field for adapting large pre-trained models to downstream tasks, greatly reducing trainable parameters while grappling with memory challenges during fine-tuning. To address it, memory-efficient series (METL) avoid backpropagating gradients through the large backbone. However, they compromise by exclusively relying o… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 23 pages, 11 figures, Accepted by ECCV2024

  2. arXiv:2403.11549  [pdf, other

    cs.CV

    Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters

    Authors: Jiazuo Yu, Yunzhi Zhuge, Lu Zhang, Ping Hu, Dong Wang, Huchuan Lu, You He

    Abstract: Continual learning can empower vision-language models to continuously acquire new knowledge, without the need for access to the entire historical dataset. However, mitigating the performance degradation in large-scale models is non-trivial due to (i) parameter shifts throughout lifelong learning and (ii) significant computational burdens associated with full-model tuning. In this work, we present… ▽ More

    Submitted 3 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: This work is accepted by CVPR2024. More modifications may be performed

  3. arXiv:2401.15975  [pdf, other

    cs.CV

    StableIdentity: Inserting Anybody into Anywhere at First Sight

    Authors: Qinghe Wang, Xu Jia, Xiaomin Li, Taiqing Li, Liqian Ma, Yunzhi Zhuge, Huchuan Lu

    Abstract: Recent advances in large pretrained text-to-image models have shown unprecedented capabilities for high-quality human-centric generation, however, customizing face identity is still an intractable problem. Existing methods cannot ensure stable identity preservation and flexible editability, even with several images for each subject during training. In this work, we propose StableIdentity, which al… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  4. arXiv:2307.12616  [pdf, other

    cs.CV cs.AI

    CTVIS: Consistent Training for Online Video Instance Segmentation

    Authors: Kaining Ying, Qing Zhong, Weian Mao, Zhenhua Wang, Hao Chen, Lin Yuanbo Wu, Yifan Liu, Chengxiang Fan, Yunzhi Zhuge, Chunhua Shen

    Abstract: The discrimination of instance embeddings plays a vital role in associating instances across time for online video instance segmentation (VIS). Instance embedding learning is directly supervised by the contrastive loss computed upon the contrastive items (CIs), which are sets of anchor/positive/negative embeddings. Recent online VIS methods leverage CIs sourced from one reference frame only, which… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: Accepted by ICCV 2023. The code is available at https://github.com/KainingYing/CTVIS

  5. arXiv:1909.04161  [pdf, other

    cs.CV

    Joint Learning of Saliency Detection and Weakly Supervised Semantic Segmentation

    Authors: Yu Zeng, Yunzhi Zhuge, Huchuan Lu, Lihe Zhang

    Abstract: Existing weakly supervised semantic segmentation (WSSS) methods usually utilize the results of pre-trained saliency detection (SD) models without explicitly modeling the connections between the two tasks, which is not the most efficient configuration. Here we propose a unified multi-task learning framework to jointly solve WSSS and SD using a single network, \ie saliency, and segmentation network… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: Accepted by ICCV19

  6. arXiv:1904.00566  [pdf, other

    cs.CV

    Multi-source weak supervision for saliency detection

    Authors: Yu Zeng, Yunzhi Zhuge, Huchuan Lu, Lihe Zhang, Mingyang Qian, Yizhou Yu

    Abstract: The high cost of pixel-level annotations makes it appealing to train saliency detection models with weak supervision. However, a single weak supervision source usually does not contain enough information to train a well-performing model. To this end, we propose a unified framework to train saliency detection models with diverse weak supervision sources. In this paper, we use category labels, capti… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

    Comments: cvpr2019

  7. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge

  8. Boundary-guided Feature Aggregation Network for Salient Object Detection

    Authors: Yunzhi Zhuge, Pingping Zhang, Huchuan Lu

    Abstract: Fully convolutional networks (FCN) has significantly improved the performance of many pixel-labeling tasks, such as semantic segmentation and depth estimation. However, it still remains non-trivial to thoroughly utilize the multi-level convolutional feature maps and boundary information for salient object detection. In this paper, we propose a novel FCN framework to integrate multi-level convoluti… ▽ More

    Submitted 27 September, 2018; originally announced September 2018.

    Comments: To appear in Signal Processing Letters (SPL), 5 pages, 5 figures and 3 tables