Computer Science > Computer Vision and Pattern Recognition

arXiv:2307.13310 (cs)

[Submitted on 25 Jul 2023]

Title:CT-Net: Arbitrary-Shaped Text Detection via Contour Transformer

Authors:Zhiwen Shao, Yuchen Su, Yong Zhou, Fanrong Meng, Hancheng Zhu, Bing Liu, Rui Yao

View PDF

Abstract:Contour based scene text detection methods have rapidly developed recently, but still suffer from inaccurate frontend contour initialization, multi-stage error accumulation, or deficient local information aggregation. To tackle these limitations, we propose a novel arbitrary-shaped scene text detection framework named CT-Net by progressive contour regression with contour transformers. Specifically, we first employ a contour initialization module that generates coarse text contours without any post-processing. Then, we adopt contour refinement modules to adaptively refine text contours in an iterative manner, which are beneficial for context information capturing and progressive global contour deformation. Besides, we propose an adaptive training strategy to enable the contour transformers to learn more potential deformation paths, and introduce a re-score mechanism that can effectively suppress false positives. Extensive experiments are conducted on four challenging datasets, which demonstrate the accuracy and efficiency of our CT-Net over state-of-the-art methods. Particularly, CT-Net achieves F-measure of 86.1 at 11.2 frames per second (FPS) and F-measure of 87.8 at 10.1 FPS for CTW1500 and Total-Text datasets, respectively.

Comments:	This paper has been accepted by IEEE Transactions on Circuits and Systems for Video Technology
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2307.13310 [cs.CV]
	(or arXiv:2307.13310v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2307.13310
Related DOI:	https://doi.org/10.1109/TCSVT.2023.3299087

Submission history

From: Zhiwen Shao [view email]
[v1] Tue, 25 Jul 2023 08:00:40 UTC (12,066 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CT-Net: Arbitrary-Shaped Text Detection via Contour Transformer

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CT-Net: Arbitrary-Shaped Text Detection via Contour Transformer

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators