Software Optical Character Recognition Applications Linux gimagereader sudo pacman -s gimagereader-qt (there are also gtk versions swap -qt for -gtk) You will be prompted to select a dataset based on your language. The table that provides a key can be found on this page. If you are looking for English search for the -eng option. https://tesseract-ocr.github.io/tessdoc/Data-Files.html Further Reading https://www.baeldung.com/linux/ocr-tools