Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.09199 (cs)

[Submitted on 14 Mar 2024]

Title:Customizing Segmentation Foundation Model via Prompt Learning for Instance Segmentation

Authors:Hyung-Il Kim, Kimin Yun, Jun-Seok Yun, Yuseok Bae

Abstract:Recently, foundation models trained on massive datasets to adapt to a wide range of domains have attracted considerable attention and are actively being explored within the computer vision community. Among these, the Segment Anything Model (SAM) stands out for its remarkable progress in generalizability and flexibility for image segmentation tasks, achieved through prompt-based object mask generation. However, despite its strength, SAM faces two key limitations when applied to customized instance segmentation that segments specific objects or those in unique environments not typically present in the training data: 1) the ambiguity inherent in input prompts and 2) the necessity for extensive additional training to achieve optimal segmentation. To address these challenges, we propose a novel method, customized instance segmentation via prompt learning tailored to SAM. Our method involves a prompt learning module (PLM), which adjusts input prompts into the embedding space to better align with user intentions, thereby enabling more efficient training. Furthermore, we introduce a point matching module (PMM) to enhance the feature representation for finer segmentation by ensuring detailed alignment with ground truth boundaries. Experimental results on various customized instance segmentation scenarios demonstrate the effectiveness of the proposed method.

Comments:	11 pages, 10 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2403.09199 [cs.CV]
	(or arXiv:2403.09199v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.09199

Submission history

From: Hyung-Il Kim [view email]
[v1] Thu, 14 Mar 2024 09:13:51 UTC (11,817 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Customizing Segmentation Foundation Model via Prompt Learning for Instance Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Customizing Segmentation Foundation Model via Prompt Learning for Instance Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators