Easy OCR Training

This repository contains the code for training Easy OCR model on custom dataset.

Installation

Install the required packages using the following command:

pip install -r requirements.txt

Usage

Prepare the dataset

In this case, we have dataset from KEC marksheets. The training data is in train_data folder and the validation data is in val_data folder.

Each folder has a labels.txt file which contains the labels for the images in the folder.

Distribution

There are 5500 (91.92%) images in the training dataset and 483 (8.08%) images in the validation dataset. The ratio is 11.38:1.

Create LMDB dataset

Before creating the dataset, update the create_lmdb_dataset.py to have below on line 47:

From:

imagePath, label = datalist[i].strip('\n').split('\t')
imagePath = os.path.join(inputPath, imagePath)

To:

imagePath, label = datalist[i].strip('\n').split('.jpg,')
imagePath += '.jpg'
imagePath = os.path.join(inputPath, imagePath)

For training dataset

python create_lmdb_dataset.py train_data train_data/labels.txt train_lmdb

For validation dataset

python create_lmdb_dataset.py val_data val_data/labels.txt val_lmdb

Obtaining a pre-trained model

For this step, we use a pretrained model which we can fine-tune on. We can download them from here.

In this project, I used TPS-ResNet-BiLSTM-Attn.pth and placed it in models folder.

Train the model

Now, we can train the model using the following command:

python train.py --train_data train_lmdb --valid_data val_lmdb --select_data "/" --batch_ratio 1.0 --Transformation TPS --FeatureExtraction ResNet --SequenceModeling BiLSTM --Prediction Attn --saved_model models/TPS-ResNet-BiLSTM-Attn.pth --batch_size 8 --data_filtering_off --workers 4 --batch_max_length 80 --num_iter 10 --valInterval 5 --FT

Name		Name	Last commit message	Last commit date
Latest commit History 177 Commits
dataset		dataset
models		models
modules		modules
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
create_lmdb_dataset.py		create_lmdb_dataset.py
dataset.py		dataset.py
demo.ipynb		demo.ipynb
demo.py		demo.py
model.py		model.py
requirements.txt		requirements.txt
separate-dataset.py		separate-dataset.py
test.py		test.py
train.ipynb		train.ipynb
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Easy OCR Training

Installation

Usage

Prepare the dataset

Distribution

Create LMDB dataset

For training dataset

For validation dataset

Obtaining a pre-trained model

Train the model

About

Releases

Packages

Languages

License

achyutkneupane/deep-text-recognition-benchmark

Folders and files

Latest commit

History

Repository files navigation

Easy OCR Training

Installation

Usage

Prepare the dataset

Distribution

Create LMDB dataset

For training dataset

For validation dataset

Obtaining a pre-trained model

Train the model

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages