From c412d8393a9bc156a3d9f58193b2ca389526a747 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Jo=C3=A3o=20Paulo=20Abdala=20Bohaczk?= <114778253+joaopabdala@users.noreply.github.com> Date: Tue, 22 Apr 2025 16:06:51 -0300 Subject: [PATCH 1/2] Update text_detection_theory.ipynb "Different from the task of target detection, target detection" to "Different from the task of text detection, target detection " --- notebook_en/2.text_detection/text_detection_theory.ipynb | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/notebook_en/2.text_detection/text_detection_theory.ipynb b/notebook_en/2.text_detection/text_detection_theory.ipynb index 5916ebe..c3093ad 100644 --- a/notebook_en/2.text_detection/text_detection_theory.ipynb +++ b/notebook_en/2.text_detection/text_detection_theory.ipynb @@ -13,7 +13,7 @@ "source": [ "## 1 Text Detection\n", "\n", - "The task of text detection is to find out the position of text in an image or video. Different from the task of target detection, target detection must not only solve the positioning problem, but also solve the problem of target classification.\n", + "The task of text detection is to find out the position of text in an image or video. Different from the task of text detection, target detection must not only solve the positioning problem, but also solve the problem of target classification.\n", "\n", "The manifestation of text in images can be regarded as a kind of 'target', and general target detection methods are also suitable for text detection. From the perspective of the task itself:\n", "\n", From d8d4e6784cc8dd42f612cc597f3f1c3fc3018cb2 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Jo=C3=A3o=20Paulo=20Abdala=20Bohaczk?= <114778253+joaopabdala@users.noreply.github.com> Date: Tue, 22 Apr 2025 16:34:20 -0300 Subject: [PATCH 2/2] Update text_detection_theory.ipynb --- notebook_en/2.text_detection/text_detection_theory.ipynb | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/notebook_en/2.text_detection/text_detection_theory.ipynb b/notebook_en/2.text_detection/text_detection_theory.ipynb index c3093ad..335f655 100644 --- a/notebook_en/2.text_detection/text_detection_theory.ipynb +++ b/notebook_en/2.text_detection/text_detection_theory.ipynb @@ -195,7 +195,7 @@ "\n", "Although the segmentation method solves the problem of curved text detection, complex post-processing logic and prediction speed are also goals that need to be optimized.\n", "\n", - "PAN [11] aims at the problem of slow text detection and prediction speed, and improves the performance of the algorithm from the aspects of network design and post-processing. First, PAN uses the lightweight ResNet18 as the Backbone, and also designs the lightweight feature enhancement module FPEM and feature fusion module FFM to enhance the features extracted by the Backbone. In terms of post-processing, a pixel clustering method is used to merge pixels whose distance from the kernel is less than the threshold d along the predicted text center (kernel). PAN guarantees high accuracy while having faster prediction speed.\n", + "PAN [11] aims at the problem of slow text detection and prediction speed, and improves the performance of the algorithm from the aspects of network design and post-processing. First, PAN uses the lightweight ResNet18 as the Backbone, and also designs the lightweight feature enhancement module FPEM and feature fusion module FFM to enhance the features extracted by the Backbone. In terms of post-processing, a pixel clustering method is used to merge pixels whose distance from the kernel is less than the threshold along the predicted text center (kernel). PAN guarantees high accuracy while having faster prediction speed.\n", "\n", "\n", "