InstallingTesseract.md

Linux

Follow the instructions here

Windows

The recommended path from the Tesseract devs is using cmake. However, we have found that differences between the compiler tools that ship with Rtools can cause issues. MinGW is out-of-date, so our recommended approach for Windows is using MSYS2.

Using MSYS2

Prep

MSYS2 ported the pacman package manager from Arch Linux, which has a simple syntax for searching for and installing packages.

  1. Install MSYS2 from here and follow their getting started steps.

  2. Install the development tools needed to build software from source by running:

pacman -S base-devel
  1. Install leptonica
pacman -S leptonica
  1. Update the pkg-config search path
export PKG_CONFIG_PATH=/mingw64/lib/pkgconfig/

Build package

Using the PKGBUILD file, issue the following command to build Tesseract

makepkg -sri

Dependencies:

Dependencies should be automatically installed from the makepkg command. In case they are not, they can be manually installed using pacman.

In MSYS2, the dependencies include a prefix string that makes the package name difficult to guess. Additionally, some packages are named slightly different from their Linux versions. To find the name of the dependency so you can install using the pacman command, search for it using regex:

pacman -Ss <regex> 

export PATH=$PATH:/mingw64/bin

Install with cmake

I generally followed the instructions here:

()[https://github.com/UB-Mannheim/tesseract/blob/master/INSTALL.GIT.md]

With one modification - the correct command for building tesseract is:

cmake .. -DLeptonica_DIR=/path/to/lept/build/bin



duncantl/Rtesseract documentation built on March 25, 2022, 5:50 a.m.