Lines Matching +full:0487 +full:a
30 This document describes a framework for preparing Unicode text
82 A. Unicode repertoires............................................23
83 A.1 Unassigned code points in Unicode 3.2.......................23
122 Similarly, a user can enter text into an application program in a
126 a consistent fashion.
128 This document specifies a framework of processing rules for Unicode
134 different input mechanisms, the strings should match on a character-
139 character sets, the transcoding algorithm is a critical part of
149 character can cause unpredictable display results, a profile can
150 specify that a string containing a backspace character would cause an
153 A profile of stringprep converts a single string of input characters
154 to a string of output characters, or returns an error if the output
155 string would contain a prohibited character. Stringprep profiles
156 cannot both emit a string and return an error.
159 might occur or that a user might expect. In particular, a profile
188 Note: A glossary of terms used in Unicode and ISO/IEC 10646 can be
194 [ISO10646]. For example, the letter "a" may be represented as either
195 "U+0061" or "LATIN SMALL LETTER A". In the lists of mappings and the
205 protocols. For example, a protocol that has strings that come from
207 specify that only strings that have been processed with a particular
208 profile of stringprep are legal. Another example would be a protocol
209 that does string comparison as a step in the protocol; that protocol
211 strings with a specific profile of stringprep.
218 When developers wish to allow users as wide of a range of characters
220 stringprep to convert characters from the input string to a canonical
235 A profile of stringprep can create tables different from those in the
240 A profile of stringprep MUST include all of the following:
264 will operate. Appendix A lists the Unicode repertoires that can be
269 repertories. Subsections of appendix A also list unassigned code
288 10FFFF for Unicode 3.2 in appendix A. The list in appendix A MUST be
290 discrepancies between the list in appendix A and the Unicode 3.2
291 specification, the list in appendix A always takes precedence.
299 In some cases, an existing profile can be reused by a different
306 1) Map -- For each character in the input, check if it has a mapping
332 receive a longer or shorter string than the one input in the
345 Each character in the input stream MUST be checked against a mapping
355 For any individual character, the mapping table MAY specify that a
357 mapped to a string of other characters.
360 is, if character A at position X is mapped to character B, character
416 If a profile is going to map characters for case-insensitive
423 upper-to-lower folding was chosen because there is a tradition of
426 If a profile creates its own mapping tables for case folding, they
435 need processing. These characters include a few Greek characters and
440 b = NormalizeWithKC(Fold(a));
442 if c is not the same as b, add a mapping for "a to c".
461 updating their profiles. Adding a new mapping for a currently-
462 assigned character, or changing an existing mapping, could cause a
469 the Unicode normalization forms, as described in [UAX15]. A profile
476 A profile MAY choose to do no normalization. However, such a profile
483 normalization orders those diacritics in a predictable fashion.
494 There is a third form of normalization, Unicode normalization with
495 form C. If a profile is going to use a Unicode normalization, it
511 A profile that specifies Unicode normalization MUST use the
515 The composition process described in [UAX15] requires a fixed
522 If a future version of the normalization tables changes the
525 update their normalization tables. Such a change could cause a
532 code points. There are a variety of prohibited code points, as
533 described in this section. A profile of this document MAY use all or
536 The stringprep process never emits both an error and a string. If an
554 It is important to note that a profile of this document MAY prohibit
567 Each subsection of this section has a matching subsection in appendix
594 200A; HAIR SPACE
631 206A-206F; [CONTROL CHARACTERS]
700 Although the replacement character (U+FFFD) might be used when a
704 example, on a computer with no Asian fonts, a string with three
713 inappropriate for strings that have to have a single canonical
735 202A; LEFT-TO-RIGHT EMBEDDING
740 206A; INHIBIT SYMMETRIC SWAPPING
764 A profile MAY choose to ignore bidirectional text. However, ignoring
773 The stringprep process never emits both an error and a string. If an
779 the requirements below, an "RandALCat character" is a character that
781 is a character that has Unicode bidirectional category "L". Note
800 2) If a string contains any RandALCat character, the string MUST NOT
803 3) If a string contains any RandALCat character, a RandALCat
804 character MUST be the first character of the string, and a
831 All code points not assigned in the character repertoire named in a
850 processing simpler and does not impose a "protocol" to handle
853 document is being written, it has happened approximately once a year.
854 Each time a new version of a repertoire appears, a new version of a
858 The list of unassigned code points MUST be given in a profile, and
864 string Y was prepared in a way that permits unassigned code points, a
866 that the strings don't match even though they would match if a more
868 and Y were prepared in a way that permits unassigned code points,
869 something worse can happen: even a positive result for the comparison
871 though they would not match if a more recent version of the profile
872 were used (one that prohibits a code point appearing in both X and
882 Each code point in a repertoire named by a profile of stringprep can
903 A subsequent version of a profile that references a newer version of
904 a repertoire with new code points will inherently have some code
906 compatibility, a subsequent version of a profile MUST NOT move code
908 points MUST NOT ever change to a different category.
911 latest version of a profile. That is, they are forbidden to contain
916 described by a profile of stringprep. Those applications MAY
917 optionally have a preprocessor that provide stricter checks: treating
919 about the fact that the code point is unassigned in the version of a
920 profile that the software is based on; such a choice is a local
925 Different software using different versions of a stringprep profile
932 The list below shows what happens if a query contains a code point
933 from category U that is allowed in a newer version of a profile. The
936 using version "oldVersion" of a profile, the stored string was
942 Because the application passed X through, it gets back a positive
944 where X is a combining mark.
947 combining mark Y has a lower combining class than X then XY will
959 stored string. However, no string can be stored with XY, so a
960 query with XY will get a negative answer to the test for matching.
964 exist in any stored string, so any query containing X will get a
971 get a negative answer to the test for matching.
973 In none of the cases does the query get data for a stored string
976 Profiles are stable between versions in the following sense: If a
983 what happens when an application uses a newer or older version of a
986 Newer query application -- Suppose that a querying application is
990 because the new profile uses a superset of the code points used for
993 Newer stored string -- Suppose that a querying application is using
994 oldVersion and the stored string was created using a profile that
999 order to get a match, the querying application has to enter the
1077 without a great deal of context such as knowing the fonts used,
1081 showing the user when a string changes between scripts.
1090 used in a profile of stringprep, it is easy to create many strings
1145 registered. The IESG MAY change a profile before registration.
1147 IANA has set up a registry of stringprep profiles. This registry is
1148 a single text file that lists the known profiles. Each entry in the
1157 Each version of a profile will remain listed in the registry forever.
1158 That is, if a new version of a profile supersedes an earlier version,
1163 It is probably harmful if a large number of profiles of stringprep
1239 A. Unicode repertoires
1245 A.1 Unassigned code points in Unicode 3.2
1247 ----- Start Table A.1 -----
1262 0487
1276 060D-061A
1296 093A-093B
1476 103A-103F
1477 105A-109F
1481 115A-115E
1532 181A-181F
1578 268A-2700
1580 270A-270B
1607 3377-337A
1708 ----- End Table A.1 -----
1721 one code point, with each code point separated by a space.
1775 004A; 006A; Case map
1799 005A; 007A; Case map
1837 010A; 010B; Case map
1845 011A; 011B; Case map
1861 012A; 012B; Case map
1868 0139; 013A; Case map
1877 014A; 014B; Case map
1885 015A; 015B; Case map
1893 016A; 016B; Case map
1901 0179; 017A; Case map
1919 018A; 0257; Case map
1942 01B1; 028A; Case map
1980 01F0; 006A 030C; Case map
1995 020A; 020B; Case map
2003 021A; 021B; Case map
2011 022A; 022B; Case map
2025 037A; 0020 03B9; Additional folding
2029 038A; 03AF; Case map
2043 039A; 03BA; Case map
2104 040A; 045A; Case map
2120 041A; 043A; Case map
2144 042A; 044A; Case map
2155 046A; 046B; Case map
2163 047A; 047B; Case map
2167 048A; 048B; Case map
2175 049A; 049B; Case map
2234 050A; 050B; Case map
2254 053A; 056A; Case map
2270 054A; 057A; Case map
2377 1E98; 0077 030A; Case map
2378 1E99; 0079 030A; Case map
2621 211A; 0071; Additional folding
2628 2124; 007A; Additional folding
2630 2128; 007A; Additional folding
2639 212A; 006B; Case map
2659 216A; 217A; Case map
2710 338A; 0070 0066; Additional folding
2713 3390; 0068 007A; Additional folding
2714 3391; 006B 0068 007A; Additional folding
2715 3392; 006D 0068 007A; Additional folding
2716 3393; 0067 0068 007A; Additional folding
2717 3394; 0074 0068 007A; Additional folding
2797 10402; 1042A; Case map
2813 1040A; 10432; Case map
2821 10412; 1043A; Case map
2829 1041A; 10442; Case map
2837 10422; 1044A; Case map
2850 1D409; 006A; Additional folding
2874 1D419; 007A; Additional folding
2884 1D43D; 006A; Additional folding
2900 1D44D; 007A; Additional folding
2910 1D471; 006A; Additional folding
2934 1D481; 007A; Additional folding
2939 1D4A5; 006A; Additional folding
2952 1D4B5; 007A; Additional folding
2962 1D4D9; 006A; Additional folding
2986 1D4E9; 007A; Additional folding
2993 1D50D; 006A; Additional folding
3015 1D541; 006A; Additional folding
3044 1D575; 006A; Additional folding
3060 1D585; 007A; Additional folding
3070 1D5A9; 006A; Additional folding
3094 1D5B9; 007A; Additional folding
3104 1D5DD; 006A; Additional folding
3120 1D5ED; 007A; Additional folding
3130 1D611; 006A; Additional folding
3154 1D621; 007A; Additional folding
3164 1D645; 006A; Additional folding
3180 1D655; 007A; Additional folding
3190 1D679; 006A; Additional folding
3214 1D689; 007A; Additional folding
3383 004A; 006A; Case map
3399 005A; 007A; Case map
3445 010A; 010B; Case map
3453 011A; 011B; Case map
3461 012A; 012B; Case map
3468 0139; 013A; Case map
3485 014A; 014B; Case map
3493 015A; 015B; Case map
3501 016A; 016B; Case map
3509 0179; 017A; Case map
3519 018A; 0257; Case map
3550 01B1; 028A; Case map
3580 01F0; 006A 030C; Case map
3603 020A; 020B; Case map
3611 021A; 021B; Case map
3619 022A; 022B; Case map
3628 038A; 03AF; Case map
3650 039A; 03BA; Case map
3708 040A; 045A; Case map
3724 041A; 043A; Case map
3740 042A; 044A; Case map
3759 046A; 046B; Case map
3767 047A; 047B; Case map
3771 048A; 048B; Case map
3779 049A; 049B; Case map
3838 050A; 050B; Case map
3850 053A; 056A; Case map
3874 054A; 057A; Case map
3973 1E98; 0077 030A; Case map
3974 1E99; 0079 030A; Case map
4212 212A; 006B; Case map
4224 216A; 217A; Case map
4304 10402; 1042A; Case map
4320 1040A; 10432; Case map
4328 10412; 1043A; Case map
4336 1041A; 10442; Case map
4344 10422; 1044A; Case map
4354 point, a semicolon, and a comment which is the name of the code
4389 200A; HAIR SPACE
4420 206A-206F; [CONTROL CHARACTERS]
4502 202A; LEFT-TO-RIGHT EMBEDDING
4507 206A; INHIBIT SYMMETRIC SWAPPING
4534 0621-063A
4543 0640-064A
4574 0041-005A
4575 0061-007A
4589 037A
4599 0388-038A
4605 048A-04CE
4796 1029-102A
4810 124A-124D
4813 125A-125D
4824 128A-128D
4839 1348-135A
4843 1681-169A
4895 210A-2113
4901 212A-212D
4907 2336-237A
4955 10330-1034A
5073 MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.