Skip to content

Commit aa6faba

Browse files
committed
mktables: Handle Unicode 16.0 Unikemet.txt file
This is handled by ignoring it for now, and letting mktables know that the properties it contains are empty. This file, new in 16.0, gives extra information about Egyption Hieroglyphics newly encoded in 16.0. It is intended only for scholars of these ancient symbols. mktables normally handles new properties automatically, but this file is in a completely different format than previous ones, so mktables would have to be adapted to understand that. That might not be too hard, given that mktables has infrastructue to handle other outliers that have come along over the years from Unicode. But, by ignoring this file, we create empty tables which generate errors in other places in perl. These are real bugs that ought to be fixed, and will be before 16.0 is incoroporated into blead. And how many Egyptologists are there in the world, much less how many use the latest Perl? So the perldelta will say that 16.0's support doesn't include these, which are mostly provisional anyway.
1 parent b30f4cc commit aa6faba

6 files changed

+26
-5
lines changed

charclass_invlists.inc

+1-1
Original file line numberDiff line numberDiff line change
@@ -436055,7 +436055,7 @@ static const U8 WB_table[23][23] = {
436055436055
* 3f4f32ed2a577344a508114527e721d7a8b633d32f38945d47fe0c743650c585 lib/unicore/extracted/DLineBreak.txt
436056436056
* 710abf2d581ac9c57f244c0834f9d9969d9781e0396adccd330eaae658ac7d6b lib/unicore/extracted/DNumType.txt
436057436057
* 6bd30f385f3baf3ab5d5308c111a81de87bea5f494ba0ba69e8ab45263b8c34d lib/unicore/extracted/DNumValues.txt
436058-
* a1534d11ec6bef256d1829e9fd74c2202ed7f1475ae3c7659d2566e92c7e0f02 lib/unicore/mktables
436058+
* 803999c789a4780e22aa278e3dfa95162e48a0d329aa993cda26b99116672651 lib/unicore/mktables
436059436059
* 55d90fdc3f902e5c0b16b3378f9eaa36e970a1c09723c33de7d47d0370044012 lib/unicore/version
436060436060
* 0a6b5ab33bb1026531f816efe81aea1a8ffcd34a27cbea37dd6a70a63d73c844 regen/charset_translations.pl
436061436061
* c7ff8e0d207d3538c7feb4a1a152b159e5e902d20293b303569ea8323e84633e regen/mk_PL_charclass.pl

lib/unicore/mktables

+21
Original file line numberDiff line numberDiff line change
@@ -871,6 +871,15 @@ push @tables_that_may_be_empty, 'Grapheme_Cluster_Break=Prepend'
871871
push @tables_that_may_be_empty, 'Canonical_Combining_Class=CCC133'
872872
if $v_version ge v6.2.0;
873873

874+
# These properties of Egyptian hieroglyphs are not yet handled by Perl. Their
875+
# intended audience is only specialist Egyptologists
876+
push @tables_that_may_be_empty, qw(kEH_Cat kEH_Desc kEH_HG kEH_IFAO
877+
kEH_JSesh
878+
kEH_NoMirror kEH_NoMirror=Yes
879+
kEH_NoMirror=No
880+
kEH_NoRotate kEH_NoRotate=Yes)
881+
if $v_version ge v16.0.0;
882+
874883
# The lists below are hashes, so the key is the item in the list, and the
875884
# value is the reason why it is in the list. This makes generation of
876885
# documentation easier.
@@ -19869,6 +19878,18 @@ my @input_file_objects = (
1986919878
Skip => $Unused_Skip,
1987019879
UCD => 0,
1987119880
),
19881+
Input_file->new('Unikemet.txt', v16.0.0,
19882+
# For Egyptian Hieroglyphs; is in an alien format to the
19883+
# other files Unicode furnishes.
19884+
Skip => $Unused_Skip,
19885+
UCD => 0,
19886+
),
19887+
Input_file->new('DoNotEmit.txt', v16.0.0,
19888+
# Advice about characters that are unwise to create; not
19889+
# any properties, though we could create some.
19890+
Skip => $Unused_Skip,
19891+
UCD => 0,
19892+
),
1987219893
);
1987319894

1987419895
# End of all the preliminaries.

lib/unicore/uni_keywords.pl

+1-1
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

regcharclass.h

+1-1
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

regexp_constants.h

+1-1
Original file line numberDiff line numberDiff line change
@@ -78,7 +78,7 @@
7878
* 3f4f32ed2a577344a508114527e721d7a8b633d32f38945d47fe0c743650c585 lib/unicore/extracted/DLineBreak.txt
7979
* 710abf2d581ac9c57f244c0834f9d9969d9781e0396adccd330eaae658ac7d6b lib/unicore/extracted/DNumType.txt
8080
* 6bd30f385f3baf3ab5d5308c111a81de87bea5f494ba0ba69e8ab45263b8c34d lib/unicore/extracted/DNumValues.txt
81-
* a1534d11ec6bef256d1829e9fd74c2202ed7f1475ae3c7659d2566e92c7e0f02 lib/unicore/mktables
81+
* 803999c789a4780e22aa278e3dfa95162e48a0d329aa993cda26b99116672651 lib/unicore/mktables
8282
* 55d90fdc3f902e5c0b16b3378f9eaa36e970a1c09723c33de7d47d0370044012 lib/unicore/version
8383
* 0a6b5ab33bb1026531f816efe81aea1a8ffcd34a27cbea37dd6a70a63d73c844 regen/charset_translations.pl
8484
* c7ff8e0d207d3538c7feb4a1a152b159e5e902d20293b303569ea8323e84633e regen/mk_PL_charclass.pl

uni_keywords.h

+1-1
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

0 commit comments

Comments
 (0)