GitHub Readme.md
This package provide a custom Heroku buildpack providing the Tesseract OCR binary and all the required libraries to Heroku apps. Training data for English language is provided by default (can be configured).
The first step consists in allowing your Heroku app to use multiple buildpacks. Heroku natively supports multiple buildpacks per app.
setup your app as
heroku buildpacks:add --index 1 https://github.com/cofacts/heroku-buildpack-tesseract
heroku buildpacks:set heroku/LANG
where LANG
is the language used by your app (e.g., ruby
, python
, or nodejs
). A complete list of Heroku buildpacks can be found here.
Note : You should make sure
heroku/nodejs
is initilized(execution order) afterheroku-buildpack-tesseract
, or npm automatically run will not work.
If you want Tesseract to be able to work with any other languages than English, set the environment variable TESSERACT_OCR_LANGUAGES
to a comma-separated string of ISO 639-2 language codes.
$ heroku config:set TESSERACT_OCR_LANGUAGES="chi_tra"
Push your code to Heroku
You can use the tesseract
binary in your Heroku app!
This fork upgrades Tesseract binary version from 3.04.01 to 4.0
MIT License.
Original work Copyright (c) 2013 Marco Azimonti
Modified work Copyright (c) 2015 Matteo Maggioni
Modified work Copyright (c) 2017 Oswell Chan
Modified work Copyright (c) 2018
Copy the snippet above into CLI.