From: Wolfram Schneider Date: Tue, 18 Nov 2008 22:45:34 +0000 (+0100) Subject: cleanup & tidy X-Git-Tag: v3.0.40~38 X-Git-Url: http://jsfdemo.indexdata.com/cgi-bin?a=commitdiff_plain;h=2e9b1a9f5eb9aa8524241772704e9828bd0de420;p=yaz-moved-to-github.git cleanup & tidy --- diff --git a/src/codetables-iso5426.xml b/src/codetables-iso5426.xml index d61546f..7d1bdb1 100644 --- a/src/codetables-iso5426.xml +++ b/src/codetables-iso5426.xml @@ -7,21 +7,13 @@ contains the ISO5426 code (in hex) for the character as coming from the G1 graphic set, the third column contains the UCS/Unicode 16-bit code (in hex), the fourth column contains the UTF-8 code (in hex) for the UCS - characters, the fifth column contains a representation of the character (where possible), + characters, the fifth column contains a representation of the character (where possible), the sixth column contains the MARC character name, followed by the UCS name. If the MARC name is the same as or very similar to the - UCS name, only the UCS name is given. For some tables alternate encodings - in Unicode and UTF-8 are given. When that occurs the alternate Unicode and + UCS name, only the UCS name is given. For some tables alternate encodings + in Unicode and UTF-8 are given. When that occurs the alternate Unicode and alternate UTF-8 columns follow the character name. - 1D 001D @@ -615,7 +607,6 @@ BRACKET SPACING TILDE / TILDE - See also Zeichentabelle MAB2 (ISO 5426-1983), http://www.gymel.com/charsets/MAB2.html @@ -641,14 +632,12 @@ BRACKET C2A1 INVERTED EXCLAMATION MARK - A2 201E E2809E LOW DOUBLE COMMA QUOTATION MARK - A3 00A3 @@ -661,19 +650,18 @@ BRACKET 24 DOLLAR SIGN - A5 00A5 C2A5 YEN SIGN - + A6 2020 E280A0 DAGGER - + A7 00A7 @@ -686,31 +674,30 @@ BRACKET E280A0 PRIME - A9 2018 E28098 SINGLE TURNED COMMA QUOTATION MARK - + AA 201C E2809C DOUBLE TURNED COMMA QUOTATION MARK - + AB 00AB E280A0 LEFT-POINTING DOUBLE ANGLE QUOTATION MARK (LEFT POINTING GUILLEMET) - + AC 266D E299AD MUSIC FLAT SIGN (FLAT) - + AD 00A9 @@ -729,17 +716,12 @@ BRACKET C2AE PATENT MARK / REGISTERED SIGN - - - - B0 02BB CABB AYN / MODIFIER LETTER TURNED COMMA - B1 02BC @@ -747,7 +729,6 @@ BRACKET CABE ALIF / MODIFIER LETTER APOSTROPHE - B2 201A @@ -772,26 +753,26 @@ BRACKET 2033 E280B3 DOUBLE PRIME - + B9 2019 E2809D RIGHT SINGLE QUOTATION MARK (SINGLE COMMA QUOTATION MARK) - + BA 201D E2809D RIGHT DOUBLE QUOTATION MARK (DOUBLE COMMA QUOTATION MARK) - + BB 00BB C2BB RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK (RIGHT POINTING GUILLEMET) - - + + BC 266F E299AF @@ -807,7 +788,7 @@ BRACKET BE 02BA CABA - HARD SIGN, DOUBLE PRIME / MODIFIER LETTER DOUBLE PRIME + HARD SIGN, DOUBLE PRIME / MODIFIER LETTER DOUBLE PRIME BF @@ -815,7 +796,6 @@ BRACKET C2BF INVERTED QUESTION MARK - true C0 @@ -908,7 +888,7 @@ BRACKET CC 0313 CC93 - HIGH COMMA, CENTERED / COMBINING COMMA ABOVE (Psili) + HIGH COMMA, CENTERED / COMBINING COMMA ABOVE (Psili) true @@ -931,14 +911,13 @@ BRACKET CC8C HACEK / COMBINING CARON - true D0 0327 CCA7 CEDILLA / COMBINING CEDILLA - + true D1 @@ -952,7 +931,7 @@ BRACKET 0326 CCA6 LEFT HOOK (COMMA BELOW) / COMBINING COMMA BELOW - + true D3 @@ -1002,7 +981,6 @@ BRACKET CCB3 DOUBLE UNDERSCORE / COMBINING DOUBLE LOW LINE - true DA @@ -1026,7 +1004,7 @@ BRACKET FE22 EFB8A2 DOUBLE TILDE, FIRST HALF / COMBINING DOUBLE TILDE - + true DE @@ -1035,18 +1013,18 @@ BRACKET FE21 EFB8A1 LIGATURE, SECOND HALF / COMBINING LIGATURE RIGHT HALF - The Ligature that spans two characters - is constructed of two halves in MARC-8: EB - (Ligature, first half) and EC (Ligature, second - half). The preferred Unicode/UTF-8 mapping is to + The Ligature that spans two characters + is constructed of two halves in MARC-8: EB + (Ligature, first half) and EC (Ligature, second + half). The preferred Unicode/UTF-8 mapping is to the single character Ligature that spans two characters, U+0361. The single character Ligature is encoded - following the second of the two characters to be spanned. - The two half Ligatures in Unicode, to which the - Ligature has been mapped since 1996, are indicted - in the mapping as alternatives, but their use is not - recommended. It is expected that font support for - the single character Ligature mark will be more + following the second of the two characters to be spanned. + The two half Ligatures in Unicode, to which the + Ligature has been mapped since 1996, are indicted + in the mapping as alternatives, but their use is not + recommended. It is expected that font support for + the single character Ligature mark will be more easily obtained than for the two halves. @@ -1057,24 +1035,22 @@ BRACKET FE23 EFB8A3 DOUBLE TILDE, SECOND HALF / COMBINING DOUBLE TILDE RIGHT HALF - The Double Tilde that spans two characters is - constructed of two halves in MARC-8: FA (Double - Tilde, first half) and FB (Double Tilde, second - half). The preferred Unicode/UTF-8 mapping - is to the single character Double Tilde that - spans two characters, U+0360. The single - character Double Tilde is encoded following - the second of the two characters to be spanned. - The two half Double Tildes in Unicode, to - which the MARC8 Double Tilde has been - mapped since 1996, are indicted in the - mapping as alternatives, but their use is not - recommended. It is expected that font support - for the single character Double Tilde mark will + The Double Tilde that spans two characters is + constructed of two halves in MARC-8: FA (Double + Tilde, first half) and FB (Double Tilde, second + half). The preferred Unicode/UTF-8 mapping + is to the single character Double Tilde that + spans two characters, U+0360. The single + character Double Tilde is encoded following + the second of the two characters to be spanned. + The two half Double Tildes in Unicode, to + which the MARC8 Double Tilde has been + mapped since 1996, are indicted in the + mapping as alternatives, but their use is not + recommended. It is expected that font support + for the single character Double Tilde mark will be more easily obtained than for the two halves. - - E1 @@ -1100,7 +1076,7 @@ BRACKET E8 0141 C581 - UPPERCASE POLISH L / LATIN CAPITAL LETTER L WITH STROKE + UPPERCASE POLISH L / LATIN CAPITAL LETTER L WITH STROKE E9 @@ -1152,7 +1128,7 @@ BRACKET 0133 C4B3 LATIN SMALL LIGATURE IJ (LATIN SMALL LETTER I J) - + F8