mirror of
https://git.hardenedbsd.org/hardenedbsd/HardenedBSD.git
synced 2025-01-01 00:18:15 +01:00
Describe collation order rules (from POSIX WG)
This commit is contained in:
parent
d2b945efae
commit
1b91f0ec7f
35
usr.bin/colldef/NOTE
Normal file
35
usr.bin/colldef/NOTE
Normal file
@ -0,0 +1,35 @@
|
|||||||
|
% Base collation scheme: 1994-03-22
|
||||||
|
|
||||||
|
% Ordering algorithm:
|
||||||
|
% 1. Spaces and hyphen (but not soft
|
||||||
|
% hyphen) before punctuation
|
||||||
|
% characters, punctuation characters
|
||||||
|
% before numbers,
|
||||||
|
% numbers before letters.
|
||||||
|
% 2. Letters with diacritical marks are
|
||||||
|
% members of equivalence classes
|
||||||
|
% 3. A distinction is made with regards
|
||||||
|
% to case as noted below.
|
||||||
|
% 4. Special characters are ignored
|
||||||
|
% when comparing letters, but then
|
||||||
|
% they are considered
|
||||||
|
% 5. The alphabets are sorted in order
|
||||||
|
% of first appearance in ISO 10646:
|
||||||
|
% Latin, Greek, Cyrillic, etc.
|
||||||
|
%
|
||||||
|
% According to Greek specifications,
|
||||||
|
% the steps 2. and 3. above are reversed
|
||||||
|
% for the Greek script
|
||||||
|
|
||||||
|
% collating symbols
|
||||||
|
|
||||||
|
collating-symbol <CAPITAL>
|
||||||
|
collating-symbol <CAPITAL-SMALL>
|
||||||
|
collating-symbol <SMALL-CAPITAL>
|
||||||
|
collating-symbol <SMALL>
|
||||||
|
|
||||||
|
% <CAPITAL-SMALL> and <SMALL-CAPITAL>
|
||||||
|
% are for cases like Danish <A><a>
|
||||||
|
% and Spanish <C><h> being treated
|
||||||
|
% as one letter.
|
||||||
|
|
Loading…
Reference in New Issue
Block a user