Main window
OCR images, PDFs and Google Books
An easy-to-use desktop tool to recognise and extract text from images, PDFs and Google Books. It uses the Tesseract OCR engine, combined with modern and efficient preprocessing and analysis pipelines, to produce high quality output in plain text, hOCR and searchable PDF format. The tool has been built with a focus on OCR of historical printed works, but it includes modern language options and also works well on modern printed works.
تغییرات در نگارش 1.3.0
3 ماه قبل
(3 ماه قبل ساخته شده)
اندازهٔ نصب شده~193.42 MiB
اندازهٔ بارگیری93.87 MiB
معماریهای موجودaarch64, x86_64
نصبها۷٬۰۴۰