Text detector based on PixelLink architecture with MobileNetV2-like as a backbone for indoor/outdoor scenes.
Metric | Value |
---|---|
F-measure (Harmonic mean of precision and recall on ICDAR2015) | 82.12% |
GFlops | 51.256 |
MParams | 6.747 |
Source framework | TensorFlow* |
Image, name: input
, shape: 1, 3, 768, 1280
in the format B, C, H, W
, where:
B
- batch sizeC
- number of channelsH
- image heightW
- image width
Expected color order: BGR
.
-
name:
model/link\_logits\_/add
, shape:1, 16, 192, 320
- logits related to linkage between pixels and their neighbors. -
name:
model/segm\_logits/add
, shape:1, 2, 192, 320
- logits related to text/no-text classification for each pixel.
Refer to PixelLink and demos for details.
[*] Other names and brands may be claimed as the property of others.