Skip to content

Files

Latest commit

25f423d · Nov 6, 2024

History

History
This branch is 2 commits behind opencv/opencv_zoo:main.

text_detection_ppocr

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
Dec 23, 2023
Jun 4, 2024
Dec 23, 2023
Nov 6, 2024
Feb 29, 2024
Jun 4, 2024
Dec 23, 2023
Dec 23, 2023
Dec 23, 2023
Nov 6, 2024
Dec 23, 2023
Dec 23, 2023
Nov 6, 2024

PP-OCRv3 Text Detection

PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System.

Note:

  • The int8 quantization model may produce unstable results due to some loss of accuracy.
  • Original Paddle Models source of English: here.
  • Original Paddle Models source of Chinese: here.
  • IC15 in the filename means the model is trained on IC15 dataset, which can detect English text instances only.
  • TD500 in the filename means the model is trained on TD500 dataset, which can detect both English & Chinese instances.
  • Visit https://docs.opencv.org/master/d4/d43/tutorial_dnn_text_spotting.html for more information.
  • text_detection_xx_ppocrv3_2023may_int8bq.onnx represents the block-quantized version in int8 precision and is generated using block_quantize.py with block_size=64.

Demo

Python

Run the following command to try the demo:

# detect on camera input
python demo.py
# detect on an image
python demo.py --input /path/to/image -v

# get help regarding various parameters
python demo.py --help

C++

Install latest OpenCV and CMake >= 3.24.0 to get started with:

# A typical and default installation path of OpenCV is /usr/local
cmake -B build -D OPENCV_INSTALLATION_PATH=/path/to/opencv/installation .
cmake --build build
# detect on camera input
./build/opencv_zoo_text_detection_ppocr -m=/path/to/model
# detect on an image
./build/opencv_zoo_text_detection_ppocr -m=/path/to/model -i=/path/to/image -v
# get help messages
./build/opencv_zoo_text_detection_ppocr -h

Example outputs

mask

gsoc

License

All files in this directory are licensed under Apache 2.0 License.

Reference