OCR Text Extraction Script

The script extracts text from images using Tesseract OCR and saves it to a text file. This has been written in Python and uses the pytesseract and Pillow libraries.

I created this to extract texts from my Reddit screenshots :P

Requirements

Tesseract OCR:
- Install Tesseract OCR and its language data files.
Python Libraries:
- pytesseract: A Python wrapper for Tesseract OCR.
- Pillow: Python Imaging Library which will be used for opening and manipulating images.

Installation

Step 1: Install Tesseract OCR

On Arch Linux:

sudo pacman -S tesseract tesseract-data-eng

On Ubuntu/Debian

sudo apt-get install tesseract-ocr

On macOS (Using homebrew)

brew install tesseract

Step 2: Install Python Libraries Using pip

pip install pytesseract pillow

Usage

Step 1: create an image directory

mkdir image

Step 2 : Copy the images to be scanned inside image directory

Step 3 : Run script.py

python3 script.py

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
script.py		script.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OCR Text Extraction Script

Requirements

Installation

Step 1: Install Tesseract OCR

On Arch Linux:

On Ubuntu/Debian

On macOS (Using homebrew)

Step 2: Install Python Libraries Using pip

Usage

Step 1: create an image directory

Step 2 : Copy the images to be scanned inside image directory

Step 3 : Run script.py

About

Uh oh!

Releases

Packages

Languages

DreadPirate07/image-text-reader

Folders and files

Latest commit

History

Repository files navigation

OCR Text Extraction Script

Requirements

Installation

Step 1: Install Tesseract OCR

On Arch Linux:

On Ubuntu/Debian

On macOS (Using homebrew)

Step 2: Install Python Libraries Using pip

Usage

Step 1: create an image directory

Step 2 : Copy the images to be scanned inside image directory

Step 3 : Run script.py

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages