Working with Tesseract OCR – Ubuntu

Note : Until this tag is removed , the blog is not complete . Please donot follow it

Step 1 : Install Tesseract using the command

sudo apt-get install tesseract-ocr

Step 2 : Install the following dependencies

sudo apt-get install autoconf automake libtool
sudo apt-get install libpng12-dev
sudo apt-get install libjpeg62-dev
sudo apt-get install libtiff4-dev
sudo apt-get install zlib1g-dev
sudo apt-get install libicu-dev      # (if you plan to make the training tools)
sudo apt-get install libpango1.0-dev # (if you plan to make the training tools)
sudo apt-get install libcairo2-dev   # (if you plan to make the training tools

Some useful Links :

1.http://blog.cedric.ws/how-to-train-tesseract-301

2.How to add  new fonts in training phase (small but presice)

http://michaeljaylissner.com/posts/2012/02/11/adding-new-fonts-to-tesseract-3-ocr-engine/

3.Training procedure  given in google site

https://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3

Advertisements