TesseractOCR by: Charles Weld & Kees van Spelde
  • 202 total downloads
  • Latest version: 5.3.5
  • Tesseract OCR text readable PDF
Tesseract 5.3.1 adds a new neural net (LSTM) based OCR engine which is focused on line recognition, but also still supports the legacy Tesseract OCR engine of Tesseract 3 which works by recognizing character patterns. Compatibility with Tesseract 3 is enabled by using the Legacy OCR Engine mode (--oem 0). It also needs traineddata files which support the legacy engine, for example those from the tessdata repository.