Text_Extract

Text extraction form Images, OCR, Tesseract, Basic Image manipulation are all important yet very basic scripts.

This script uses pytesseract for text extraction from images, considering it only recognizes text and can only print it, this script additionally adds a functionality to write the text in a txt and/or csv file.

Setup instructions

Setup a python 3.x virtual environment.
Activate the environment
Install the dependencies using pip3 install -r requirements.txt
You are all set and the script is Ready to run.
Carefully follow the Instructions.

Usage

Just make sure that Tesseract is in proper directory, run the code according the comments and guidelines.

Smaple -
Enter the Folder name containing Images: <Name of Folder>
Enter your desired output location: <Name of Folder>

Output

Image containing Text

Before Compression

After Extraction

After Backup

Author(s)

Made by Vybhav Chaturvedi

AI Accelerated Quality -PRO