Skip to content

A snipping tool to Extract Text By snipping using cursor from anywhere on the screen.

Notifications You must be signed in to change notification settings

JHM69/ocr-snipping-tool

Folders and files

NameName
Last commit message
Last commit date

Latest commit

e69ef7a · Dec 28, 2024

History

24 Commits
Dec 13, 2024
Dec 13, 2024
Dec 28, 2024
Dec 13, 2024
Dec 13, 2024
Dec 13, 2024
Dec 13, 2024
Dec 13, 2024
Dec 13, 2024
Dec 13, 2024
Dec 13, 2024
Dec 13, 2024
Dec 13, 2024

Repository files navigation

OCR Snipping Tool

A snipping tool to Extract Text By snipping using cursor from anywhere on the screen.

Demo Demo

Home Home

Setting Setting

Setup Guide

Requirements

  • Python 3.x
  • PyQt5
  • Pillow
  • OpenCV
  • NumPy
  • pytesseract
  • pyperclip3

Installation

  1. Clone the repository:

    git clone https://github.com/JHM69/ocr-snipping-tool.git
    cd ocr-snipping-tool
  2. Install the required packages:

    pip install -r requirements.txt
  3. Tesseract Setup:

    • Windows:
      1. Download the Tesseract installer from Tesseract at UB Mannheim.
      2. Install Tesseract and note the installation path (e.g., C:\Program Files\Tesseract-OCR\tesseract.exe).
    • Linux:
      1. Install Tesseract using the package manager:
        sudo apt install tesseract-ocr
  4. Run the application:

    • Launch the application by running:
    python main.py
  5. Set API Key and Tesseract Path:

    • Once the application is running, you can enter the Tesseract path and Gemini API key in the provided input fields in the UI.
    • Click the "Save" button to store these settings.
  6. Using the Tool:

    • Press Ctrl + N to create a new snip.
    • Press Q to quit the application.

OCR Engine and Language Settings

  • You can select between Tesseract and Gemini OCR engines from the dropdown menu in the application.
  • You can also select the language for OCR from the settings tab.
  • Supported languages include English, Bangla, Hindi, Japanese, Spanish, French, German, Chinese (Simplified), Russian, and Arabic.