Skip to content
Closed
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
11 changes: 3 additions & 8 deletions imsc1/spec/ttml-ww-profiles.html
Original file line number Diff line number Diff line change
Expand Up @@ -128,11 +128,6 @@
publisher: "ARIB",
href:"https://www.arib.or.jp/english/std_tr/broadcasting/std-b62.html",
title: "STD-B62, Multimedia Coding Specification For Digital Broadcasting (Second Generation), Version 2.2 (Fascicle 1)"
},
"ISO10646": {
publisher: "International Organization for Standardization",
href: "https://www.iso.org/standard/76835.html",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removing this means there's no link at all to this reference, and that it is the only reference in the spec without a link.

That seems like a non-ideal result. Specref doesn't have any link to 10646 that I can find. @himorin do you know if there's any other way to reference this important document? I wonder if i18n has a guide for how to reference it, for example.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removing this means there's no link at all to this reference, and that it is the only reference in the spec without a link.

Do you mean without URL?

ISO does not offer URLs to the latest edition of the standard.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As pointed during call (sorry Ive missed at that period), I also could not find any canonical pointer to the newest version of 10646. ICS does not work on this front nor search box does not change URL.
Also for character collection we refer for ja subset, I could not find any pointer within Unicode one. I haven't checked entire UTS or else, but seems missIng in Unicode. (or may have some replacement by General Category or something??)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For other specs, I suppose most of text refer Unicode but not 10646, which seems not in xref neither(??)...

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also for character collection we refer for ja subset, I could not find any pointer within Unicode one. I haven't checked entire UTS or else, but seems missIng in Unicode.

I think you might be right. See https://www.unicode.org/versions/Unicode17.0.0/core-spec/appendix-c/

Few applications are expected to make use of all of the characters defined in ISO/IEC 10646. The conformance clauses of the two standards address this situation in very different ways. ISO/IEC 10646 provides a mechanism for specifying included subsets of the character repertoire, permitting implementations to ignore characters that are not included (see normative Annex A of ISO/IEC 10646). A Unicode implementation requires a minimal level of handling all character codes—namely, the ability to store and retransmit them undamaged. Thus the Unicode Standard encompasses the entire ISO/IEC 10646 repertoire without requiring that any particular subset be implemented.

So maybe we can have a dated reference to a specific edition of ISO/IEC 10646 strictly for the Japanese collections.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

downloaded ISO PDF and checked about collection.
Collections are defined in normative Annex A Collections of graphic characters for subsets, and defined collections 301 to 321 represent whole UCS, like 303 for Unicode 3.1 (detailed in Annex A.3). So, I suppose this will not be in Unicode side, but just in ISO. CJK collections, from 370, are defined in Annex A.4 subsections and refer source reference file for CJK Unified Ideographs (CJKSrc). Collections from 370 to 375 (and 380-389) are marked as fixed collection, so these will not be changed by newer edition of ISO 10646 (p2751 of 10646:2000).

So, in total, I'd agree to use dated reference.

title: " ISO/IEC 10646:2020 Information technology — Universal coded character set (UCS)/"
}
}
};
Expand Down Expand Up @@ -3627,11 +3622,11 @@ <h2>Common Character Sets</h2>

<td>
(Basic Japanese Collection)<br>
Collection 285 at [[ISO10646]]<br>
Collection 285 at [[UNICODE]]<br>
(Japanese Non Ideographic Extension)<br>
Collection 286 at [[ISO10646]]<br>
Collection 286 at [[UNICODE]]<br>
(JIS2004 Ideographics Extension)<br>
Collection 371 at [[ISO10646]]<br>
Collection 371 at [[UNICODE]]<br>
(Fullwidth ASCII variants)<br>
U+FF01 – U+FF5E<br>
(Fullwidth Symbol variants)<br>
Expand Down