This Python application allows users to perform Optical Character Recognition (OCR) on a selected region of the screen or an imported image. The GUI is built using the CustomTkinter library and supports various image preprocessing options to enhance OCR accuracy.
- Select a region of the screen to capture and perform OCR
- Import an image and perform OCR
- Preprocess images to improve OCR accuracy
- Display input and output images
- Adjustable settings, including language, preprocessing parameters, and shortcuts
- Python 3.6 or higher
- pytesseract
- Pillow (PIL)
- OpenCV (cv2)
- numpy
- pyautogui
- pyperclip
- customtkinter
- Clone the repository:
git clone https://github.com/korinz/screen_region_ocr_gui.git
- Change to the project directory:
cd screen_region_ocr_gui
- Install the required dependencies:
pip install -r requirements.txt
- Set the path to the Tesseract executable in
main.py
.
- Run the main script:
python gui.py
-
Use the GUI to take a screenshot, import an image, and perform OCR on the selected region or imported image.
-
Adjust settings and preprocessing parameters as needed.
-
View the input and output images in the GUI.
-
The OCR result will be printed in the terminal.
This project is licensed under the MIT License.