【CVPR 2017】Detecting Oriented Text in Natural Images by Linking Segments

作者：时间：2017-07-30 浏览：

SegLink是一种自下而上的文本检测算法。大多数先进的文本检测算法是针对水平的拉丁文本，并且速度难以满足实时性应用。SegLink将文本分解为两个独立的可检测的元素，分别称为segment和link。segment指包围单词或者文本行的可旋转框。link则是用于连接相邻的segment，用以指示segment属于同一单词或者文本行。这两个元素都可以通过神经网络进行预测。link对各个segment进行链接，从而组成最终检测结果。在水平、旋转以及多语言文本数据集上的卓越表现说明了SegLink的精度高、速度快和灵活性强的特点。

SegLink is a bottom-up text detection algorithm. Most state-of-the-art text detection methods are specific to horizontal Latin text and are not fast enough for real-time applications. SegLink decompose text into two locally detectable elements, namely segments and links. A segment is an oriented box covering a part of a word or text line; A link connects two adjacent segments, indicating that they belong to the same word or text line. Both elements are detected by neural network. Final detections are produced by combining segments connected by links. The superior performance on horizontal, oriented, and multi-lingual text datasets demonstrate that SegLink is accurate, fast and flexible.

文章链接：https://openaccess.thecvf.com/content_cvpr_2017/papers/Shi_Detecting_Oriented_Text_CVPR_2017_paper.pdf

代码地址：https://github.com/bgshih/seglink

发表刊物：IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017

上一篇：【TPAMI 2018】Aster: An attentional scene text recognizer with flexible rectification

下一篇：【TPAMI 2017】An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition

研究成果

代表作论文

【CVPR 2017】Detecting Oriented Text in Natural Images by Linking Segments

友情链接

联系我们