Scene text removal via cascaded text stroke detection and erasing

The training set of synthetic database consists of a total of 617401 images and the test set contains 2000 images; all the training and test samples are resized to 256 × 256. The code for generating synthetic dataset and more synthetic text images as described in “Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, Synthetic Data for Text localisation in Natural Images, CVPR 2016", and can be found in (https://github.com/ankush-me/SynthText). Besides, all the real scene text images are also resized to 256 × 256.

For more details, please refer to our [CVM 2021 paper] arXiv: https://arxiv.org/abs/2011.09768

Requirements

Tensorflow==1.13.1
Python==3.6.13
CUDA==10.0.
Opencv==4.5.1.
Numpy.

Installation

Clone this respository.

git clone https://github.com/wcq19941215/SceneTextRemoval.git

Running

1. Image Prepare

You can modify the path of the trainset, valset dir, and other hyperparameters in config.yml. It should be noted that during training, gt, mask, and image are concat into a single image, which will be automatically separated during training.

2. Training

Once config.yml is configured, you only need to run train.py. Then run the following code:

python train.py

3. Testing

During the test, test.py can only get the output result, not including the evaluation result such as PSNR. Please run the following code:

python test.py \
--image=[the path of test images] \
--mask=[the path of test mask] \
--output=[Where to save output image.] \
--checkpoint_dir=[The directory of tensorflow checkpoint]

For fair comparison, we use the same evaluation method as Ensnet, you can find the evaluation metrics in this website PythonCode.zip, You can also use UTILITY.py to test PSNR and SSIM

4. Pretrained models

Please download the our pretrained models TextRomoval PASSWORD:1234.

5. Dataset

The dataset can be obtained by sending a request email to us. Specifically, the researchers should download and fill up this Agreement Form and send it back to Weize Quan (weize.quan AT nlpr.ia.ac.cn; Email title: Scene Text Removal Dataset Request). We will then send you the download instructions at our discretion.

Paper

Please consider to cite our paper when you use our database:

@article{Bian2021Scence,
  title = {Scene text removal via cascaded text stroke detection and erasing},
  author = {Xuewei Bian, Chaoqun Wang, Weize Quan, Juntao Ye, Xiaopeng Zhang, Dong-Ming Yan}
  publisher = {Computational Visual Media},
  year = {2022},
  journal = {Computational Visual Media},
  volume = {8},
  number = {2},
  numpages = {15},
  keywords = {;scene text removal;text stroke detection;generative adversarial networks;cascaded network design;real-world dataset},
  doi = {10.1007/s41095-021-0242-8}
}

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
images		images
Agreement_form.doc		Agreement_form.doc
Evaluate.py		Evaluate.py
README.md		README.md
UTILITY.py		UTILITY.py
config.py		config.py
config.yml		config.yml
data_load.py		data_load.py
gauss.py		gauss.py
loss.py		loss.py
metrics.py		metrics.py
model.py		model.py
network.py		network.py
test.py		test.py
train.py		train.py
untils.py		untils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scene text removal via cascaded text stroke detection and erasing

Requirements

Installation

Running

1. Image Prepare

2. Training

3. Testing

4. Pretrained models

5. Dataset

Paper

About

Releases

Packages

Contributors 2

Languages

wcq19941215/SceneTextRemoval

Folders and files

Latest commit

History

Repository files navigation

Scene text removal via cascaded text stroke detection and erasing

Requirements

Installation

Running

1. Image Prepare

2. Training

3. Testing

4. Pretrained models

5. Dataset

Paper

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages