Skip to content

Commit 552c708

Browse files
committed
mktables: Handle Unicode 16.0 DoNotEmit.txt
We handle it by ignoring this file, new to Unicode 16.0. It consists of lists of characters that, to put it less delicately than Unicode would like, they regret creating. But there are no rules associated with them. It would be nice to have a \p{DoNotEmit} property so that applications could handle situations where this occurs. But I'm fearful that if we did something like this, that Unicode would later come up with something that had the same intention but would be subtly or unsubtly different. That has happened before, to our detriment. So I think we should wait to see what they do do, in future releases.
1 parent 397e387 commit 552c708

6 files changed

+11
-5
lines changed

charclass_invlists.inc

+1-1
Original file line numberDiff line numberDiff line change
@@ -436055,7 +436055,7 @@ static const U8 WB_table[23][23] = {
436055436055
* 3f4f32ed2a577344a508114527e721d7a8b633d32f38945d47fe0c743650c585 lib/unicore/extracted/DLineBreak.txt
436056436056
* 710abf2d581ac9c57f244c0834f9d9969d9781e0396adccd330eaae658ac7d6b lib/unicore/extracted/DNumType.txt
436057436057
* 6bd30f385f3baf3ab5d5308c111a81de87bea5f494ba0ba69e8ab45263b8c34d lib/unicore/extracted/DNumValues.txt
436058-
* a1534d11ec6bef256d1829e9fd74c2202ed7f1475ae3c7659d2566e92c7e0f02 lib/unicore/mktables
436058+
* 2851ec4057abad0019e802bee35d17b13a95b75f7b72651edd27c6e31d527fac lib/unicore/mktables
436059436059
* 55d90fdc3f902e5c0b16b3378f9eaa36e970a1c09723c33de7d47d0370044012 lib/unicore/version
436060436060
* 0a6b5ab33bb1026531f816efe81aea1a8ffcd34a27cbea37dd6a70a63d73c844 regen/charset_translations.pl
436061436061
* c7ff8e0d207d3538c7feb4a1a152b159e5e902d20293b303569ea8323e84633e regen/mk_PL_charclass.pl

lib/unicore/mktables

+6
Original file line numberDiff line numberDiff line change
@@ -19884,6 +19884,12 @@ my @input_file_objects = (
1988419884
Skip => $Unused_Skip,
1988519885
UCD => 0,
1988619886
),
19887+
Input_file->new('DoNotEmit.txt', v16.0.0,
19888+
# Advice about characters that are unwise to create; not
19889+
# any properties, though we could create some.
19890+
Skip => $Unused_Skip,
19891+
UCD => 0,
19892+
),
1988719893
);
1988819894

1988919895
# End of all the preliminaries.

lib/unicore/uni_keywords.pl

+1-1
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

regcharclass.h

+1-1
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

regexp_constants.h

+1-1
Original file line numberDiff line numberDiff line change
@@ -78,7 +78,7 @@
7878
* 3f4f32ed2a577344a508114527e721d7a8b633d32f38945d47fe0c743650c585 lib/unicore/extracted/DLineBreak.txt
7979
* 710abf2d581ac9c57f244c0834f9d9969d9781e0396adccd330eaae658ac7d6b lib/unicore/extracted/DNumType.txt
8080
* 6bd30f385f3baf3ab5d5308c111a81de87bea5f494ba0ba69e8ab45263b8c34d lib/unicore/extracted/DNumValues.txt
81-
* a1534d11ec6bef256d1829e9fd74c2202ed7f1475ae3c7659d2566e92c7e0f02 lib/unicore/mktables
81+
* 2851ec4057abad0019e802bee35d17b13a95b75f7b72651edd27c6e31d527fac lib/unicore/mktables
8282
* 55d90fdc3f902e5c0b16b3378f9eaa36e970a1c09723c33de7d47d0370044012 lib/unicore/version
8383
* 0a6b5ab33bb1026531f816efe81aea1a8ffcd34a27cbea37dd6a70a63d73c844 regen/charset_translations.pl
8484
* c7ff8e0d207d3538c7feb4a1a152b159e5e902d20293b303569ea8323e84633e regen/mk_PL_charclass.pl

uni_keywords.h

+1-1
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

0 commit comments

Comments
 (0)