Rogue Scholar

PackagesTesseractTech NotesComputer and Information Sciences

Tesseract Update: Options and Languages

Published December 8, 2016 in rOpenSci - open tools for open science

Author Jeroen Ooms

A few weeks ago we announced the first release of the tesseract package: a high quality OCR engine in R. We have now released an update with extra features. Installing Training Data As explained in the first post, the tesseract system is powered by language specific training data. By default only English training data is installed. Version 1.3 adds utilities to make it easier to install additional training data.