The script extracts text from images using Tesseract OCR and saves it to a text file.
This has been written in Python and uses the pytesseract and Pillow libraries.
I created this to extract texts from my Reddit screenshots :P
- Tesseract OCR:
- Install Tesseract OCR and its language data files.
- Python Libraries:
pytesseract: A Python wrapper for Tesseract OCR.Pillow: Python Imaging Library which will be used for opening and manipulating images.
sudo pacman -S tesseract tesseract-data-engsudo apt-get install tesseract-ocrbrew install tesseractpip install pytesseract pillowmkdir imagepython3 script.py