Skip to content

Improving Document Binarization via Adversarial Noise-Texture Augmentation (ICIP 2019)

Notifications You must be signed in to change notification settings

ankanbhunia/AdverseBiNet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Improving Document Binarization via Adversarial Noise-Texture Augmentation [paper] [ICIP 2019]

This repository contains the full source codes and instructions to use the codes on the datasets described in the paper. The paper re-visits the binarization problem by introducingan adversarial learning approach.

The most significant contribution of our framework is that it does not require any paired data unlike other Deep Learning-based methods [ronneberger et al., Vo et al.,Konwer et al.].Such a novel approach has never been implemented earlier thus making it the very first of its kind in Document Image Analysis community.

Results

In this paper, we propose a two-stage network that first learns to augment the document images by using neural style transfer technique. For this purpose, we construct a Texture Augmentation Network that transfers the texture element of a degraded reference document image to a clean binary image.

Achitecture In this way, the network creates multiple versions of the same textual content with various noisy textures, enlarging the available document binarization datasets. At last, the newly generated images are passed through a Binarization network to get back the clean version.

Pre-requisites

  • python 2.7
  • Tensorflow
  • OpenCV
  • matplotlib

Training

  • A total of 9 datasets are used in this work: DIBCO 2009, DIBCO 2011, DIBCO 2013, H-DIBCO 2010, HDIBCO 2012, H-DIBCO 2014, Bickley diary, PHIDB, and S-MS datasets.
  • Out of these datasets, DIBCO 2013 dataset is selected for testing purposes. For the testing, the remaining datasets are used as a training set.
  • We convert the images from these datasets to patches of size 256 X 256.
  • Download the VGG weights from here and put it in the repository folder.
  • To train the model run the 'train.py' file

Citation

If you find this code useful in your research, please consider citing:

@article{bhunia2018improving,
  title={Improving Document Binarization via Adversarial Noise-Texture Augmentation},
  author={Bhunia, Ankan Kumar and Bhunia, Ayan Kumar and Sain, Aneeshan and Roy, Partha Pratim},
  journal={arXiv preprint arXiv:1810.11120},
  year={2018}
}

About

Improving Document Binarization via Adversarial Noise-Texture Augmentation (ICIP 2019)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages