text_detection_ppocr

[GSoC] Add block quantized models (opencv#270 )

Nov 6, 2024

25f423d · Nov 6, 2024

This branch is 2 commits behind opencv/opencv_zoo:main.

Name	Name	Last commit message	Last commit date
parent directory ..
example_outputs	example_outputs	add text detection model from ppocrv3 (opencv#180 )	Dec 23, 2023
CMakeLists.txt	CMakeLists.txt	bump opencv version to 4.10.0 (opencv#260 )	Jun 4, 2024
LICENSE	LICENSE	add text detection model from ppocrv3 (opencv#180 )	Dec 23, 2023
README.md	README.md	[GSoC] Add block quantized models (opencv#270 )	Nov 6, 2024
demo.cpp	demo.cpp	Fix spelling, spacing, and unused variables (opencv#236 )	Feb 29, 2024
demo.py	demo.py	bump opencv version to 4.10.0 (opencv#260 )	Jun 4, 2024
ppocr_det.py	ppocr_det.py	add text detection model from ppocrv3 (opencv#180 )	Dec 23, 2023
text_detection_cn_ppocrv3_2023may.onnx	text_detection_cn_ppocrv3_2023may.onnx	add text detection model from ppocrv3 (opencv#180 )	Dec 23, 2023
text_detection_cn_ppocrv3_2023may_int8.onnx	text_detection_cn_ppocrv3_2023may_int8.onnx	add text detection model from ppocrv3 (opencv#180 )	Dec 23, 2023
text_detection_cn_ppocrv3_2023may_int8bq.onnx	text_detection_cn_ppocrv3_2023may_int8bq.onnx	[GSoC] Add block quantized models (opencv#270 )	Nov 6, 2024
text_detection_en_ppocrv3_2023may.onnx	text_detection_en_ppocrv3_2023may.onnx	add text detection model from ppocrv3 (opencv#180 )	Dec 23, 2023
text_detection_en_ppocrv3_2023may_int8.onnx	text_detection_en_ppocrv3_2023may_int8.onnx	add text detection model from ppocrv3 (opencv#180 )	Dec 23, 2023
text_detection_en_ppocrv3_2023may_int8bq.onnx	text_detection_en_ppocrv3_2023may_int8bq.onnx	[GSoC] Add block quantized models (opencv#270 )	Nov 6, 2024

README.md

PP-OCRv3 Text Detection

PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System.

Note:

The int8 quantization model may produce unstable results due to some loss of accuracy.
Original Paddle Models source of English: here.
Original Paddle Models source of Chinese: here.
IC15 in the filename means the model is trained on IC15 dataset, which can detect English text instances only.
TD500 in the filename means the model is trained on TD500 dataset, which can detect both English & Chinese instances.
Visit https://docs.opencv.org/master/d4/d43/tutorial_dnn_text_spotting.html for more information.
text_detection_xx_ppocrv3_2023may_int8bq.onnx represents the block-quantized version in int8 precision and is generated using block_quantize.py with block_size=64.

Demo

Python

Run the following command to try the demo:

# detect on camera input
python demo.py
# detect on an image
python demo.py --input /path/to/image -v

# get help regarding various parameters
python demo.py --help

C++

Install latest OpenCV and CMake >= 3.24.0 to get started with:

# A typical and default installation path of OpenCV is /usr/local
cmake -B build -D OPENCV_INSTALLATION_PATH=/path/to/opencv/installation .
cmake --build build
# detect on camera input
./build/opencv_zoo_text_detection_ppocr -m=/path/to/model
# detect on an image
./build/opencv_zoo_text_detection_ppocr -m=/path/to/model -i=/path/to/image -v
# get help messages
./build/opencv_zoo_text_detection_ppocr -h

Example outputs

License

All files in this directory are licensed under Apache 2.0 License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

text_detection_ppocr

text_detection_ppocr

README.md

PP-OCRv3 Text Detection

Demo

Python

C++

Example outputs

License

Reference

Files

text_detection_ppocr

Directory actions

More options

Directory actions

More options

Latest commit

History

text_detection_ppocr

Folders and files

parent directory

README.md

PP-OCRv3 Text Detection

Demo

Python

C++

Example outputs

License

Reference