Bug 7607

Summary: Unicode 5.0.0 Support
Product: DejaVu Reporter: pipian
Component: GeneralAssignee: Deja Vu bugs <dejavu-bugs>
Status: NEW --- QA Contact:
Severity: enhancement    
Priority: high CC: markus.s, robin
Version: unspecified   
Hardware: All   
OS: All   
URL: http://www.unicode.org/versions/Unicode5.0.0/
Whiteboard:
i915 platform: i915 features:

Description pipian 2006-07-23 14:34:26 UTC
Probably a long-term enhancement bug, but it might be useful as a meta-bug for Unicode 5.0.0 issues.

Given that Unicode 5.0.0 appears to be 'published' as of July 23, 2006 (i.e. data files are finalized, but 
the book is not due to be released until 4Q 2006, and chapters not up until 1Q2007) it may be 
worthwhile to aim for moving any Unicode 5.0.0 characters from the PUA and (at the very least) start 
work on the new allocations.

I don't think there's much reason to hesitate on moving and adding characters either.  As the Unicode 
5.0.0 page (URL linked above) says:

"In order to provide an opportunity for developers to develop Unicode 5.0 as soon as possible, these 
data files have been released ahead of the publication of the text of the standard."

While character designs for the added characters are not visible in the downloadable code charts yet, 
proposal N2991 (http://std.dkuug.dk/jtc1/sc2/wg2/docs/N2991.pdf) appears to have all of the 
characters added, as well as notes as to the original proposals for the letters added.

Of note are the following characters added to the standard as of 5.0.0 that are (presumably) highest 
priority (being from the Latin, Greek, and Cyrillic scripts, as well as various Symbols, Punctuation, and 
Diacritics):

U+0242: LATIN SMALL LETTER GLOTTAL STOP
U+0243: LATIN CAPITAL LETTER B WITH STROKE
U+0244: LATIN CAPITAL LETTER U BAR
U+0245: LATIN CAPITAL LETTER TURNED V
U+0246: LATIN CAPITAL LETTER E WITH STROKE
U+0247: LATIN SMALL LETTER E WITH STROKE
U+0248: LATIN CAPITAL LETTER J WITH STROKE
U+0249: SMALL LETTER J WITH STROKE
U+024A: LATIN CAPITAL LETTER SMALL Q WITH HOOK TAIL
U+024B: LATIN SMALL LETTER Q WITH HOOK TAIL
U+024C: LATIN CAPITAL LETTER R WITH STROKE
U+024D: LATIN SMALL LETTER R WITH STROKE
U+024E: LATIN CAPITAL LETTER Y WITH STROKE
U+024F: LATIN SMALL LETTER Y WITH STROKE
U+037B: GREEK SMALL REVERSED LUNATE SIGMA SYMBOL
U+037C: GREEK SMALL DOTTED LUNATE SIGMA SYMBOL
U+037D: GREEK SMALL REVERSED DOTTED LUNATE SIGMA SYMBOL
U+04CF: CYRILLIC SMALL LETTER PALOCHKA
U+04FA: CYRILLIC CAPITAL LETTER GHE WITH STROKE AND HOOK
U+04FB: CYRILLIC SMALL LETTER GHE WITH STROKE AND HOOK
U+04FC: CYRILLIC CAPITAL LETTER HA WITH HOOK
U+04FD: SMALL LETTER HA WITH HOOK
U+04FE: CYRILLIC CAPITAL LETTER HA WITH STROKE
U+04FF: CYRILLIC SMALL LETTER HA WITH STROKE
U+0510: CYRILLIC CAPITAL LETTER REVERSED ZE
U+0511: CYRILLIC SMALL LETTER REVERSED ZE
U+0512: CYRILLIC CAPITAL LETTER EL WITH HOOK
U+0513: CYRILLIC SMALL LETTER EL WITH HOOK
U+1DC4: COMBINING MACRON-ACUTE
U+1DC5: COMBINING GRAVE-MACRON
U+1DC6: COMBINING MACRON-GRAVE
U+1DC7: COMBINING ACUTE-MACRON
U+1DC8: COMBINING GRAVE-ACUTE-GRAVE
U+1DC9: COMBINING ACUTE-GRAVE-ACUTE
U+1DCA: COMBINING LATIN SMALL LETTER R BELOW
U+1DFE: COMBINING LEFT ARROWHEAD ABOVE
U+1DFF: COMBINING RIGHT ARROWHEAD AND DOWN ARROWHEAD BELOW
U+20EC: COMBINING RIGHTWARDS HARPOON WITH BARB DOWNWARDS
U+20ED: COMBINING LEFTWARDS HARPOON WITH BARB DOWNWARDS
U+20EE: COMBINING LEFT ARROW BELOW
U+20EF: COMBINING RIGHT ARROW BELOW
U+214D: AKTIESELSKAB
U+214E: TURNED SMALL F
U+2184: LATIN SMALL LETTER REVERSED C
U+23DC: TOP PARENTHESIS (mathematical use)
U+23DD: BOTTOM PARENTHESIS (mathematical use)
U+23DE: TOP CURLY BRACKET (mathematical use)
U+23DF: BOTTOM CURLY BRACKET (mathematical use)
U+23E0: TOP TORTOISE SHELL BRACKET (mathematical use)
U+23E1: BOTTOM TORTOISE SHELL BRACKET (mathematical use)
U+23E2: WHITE TRAPEZIUM
U+23E3: BENZENE RING WITH CIRCLE
U+23E4: STRAIGHTNESS
U+23E5: FLATNESS
U+23E6: AC CURRENT
U+23E7: ELECTRICAL INTERSECTION
U+26B2: NEUTER
U+27C7: OR WITH DOT INSIDE
U+27C8: REVERSE SOLIDUS PRECEDING SUBSET
U+27C9: SUPERSET PRECEDING SOLIDUS
U+27CA: VERTICAL BAR WITH HORIZONTAL STROKE
U+2B14: SQUARE WITH UPPER RIGHT DIAGONAL HALF BLACK
U+2B15: SQUARE WITH LOWER LEFT DIAGONAL HALF BLACK
U+2B16: DIAMOND WITH LEFT HALF BLACK
U+2B17: DIAMOND WITH RIGHT HALF BLACK
U+2B18: DIAMOND WITH TOP HALF BLACK
U+2B19: DIAMOND WITH BOTTOM HALF BLACK
U+2B1A: DOTTED SQUARE
U+2B20: WHITE PENTAGON
U+2B21: WHITE HEXAGON
U+2B22: BLACK HEXAGON
U+2B23: HORIZONTAL BLACK HEXAGON
U+2C60: LATIN CAPITAL LETTER L WITH DOUBLE BAR
U+2C61: LATIN SMALL LETTER L WITH DOUBLE BAR
U+2C62: LATIN CAPITAL LETTER L WITH MIDDLE TILDE
U+2C63: LATIN CAPITAL LETTER P WITH STROKE
U+2C64: LATIN CAPITAL LETTER R WITH TAIL
U+2C65: LATIN SMALL LETTER A WITH STROKE
U+2C66: LATIN SMALL LETTER T WITH DIAGONAL STROKE
U+2C67: LATIN CAPITAL LETTER H WITH DESCENDER
U+2C68: LATIN SMALL LETTER H WITH DESCENDER
U+2C69: LATIN CAPITAL LETTER K WITH DESCENDER
U+2C6A: LATIN SMALL LETTER K WITH DESCENDER
U+2C6B: LATIN CAPITAL LETTER Z WITH DESCENDER
U+2C6C: LATIN SMALL LETTER Z WITH DESCENDER
U+2C74: LATIN SMALL LETTER V WITH CURL
U+2C75: LATIN CAPITAL LETTER HALF H
U+2C76: LATIN SMALL LETTER HALF H
U+2C77: LATIN SMALL LETTER TAILLESS PHI
U+A717: MODIFIER LETTER DOT VERTICAL BAR
U+A718: MODIFIER LETTER DOT SLASH
U+A719: MODIFIER LETTER DOT HORIZONTAL BAR
U+A71A: MODIFIER LETTER LOWER RIGHT CORNER ANGLE
U+A720: MODIFIER LETTER STRESS AND HIGH TONE
U+A721: MODIFIER LETTER STRESS AND LOW TONE

Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.