code.delx.au - gnu-emacs/blob - README.unicode

   1                                                                    -*-text-*-
   2
   3 Problems, fixmes and other issues in the emacs-unicode branch
   4
   5 Notes by fx to record a few things.  handa needs to check them --
   6 don't take too seriously, especially with regard to completeness.
   7
   8 Do take seriously that you don't want this CVS branch unless you're
   9 actually working on it.  If you just want to edit Unicode and/or unify
  10 iso-8859 et al, see the existing support and the extra stuff at
  11 <URL:ftp://dlpx1.dl.ac.uk/fx/emacs/Mule>.  Editing support is mostly
  12 orthogonal to the internal representation.
  13
  14  * SINGLE_BYTE_CHAR_P returns true for Latin-1 characters.
  15
  16  * Grok UTF-8 surrogates.
  17
  18  * Rationalize character syntax and its relationship to the Unicode
  19    database.  Specifically, the latin-N.el files aren't consistent for
  20    common characters.
  21
  22  * Fontset handling and customization needs work.
  23
  24  * Likewise for charset and coding system priorities.
  25
  26  * The relevant bits of latin1-disp.el need porting (and probably
  27    re-naming/updating).  See also cyril-util.el.
  28
  29  * Quail files need work now the encoding is irrelevant.  E.g. make
  30    unified Latin pre- and post- methods.
  31
  32  * What to do with the old coding categories stuff?
  33
  34  * Something's wrong with categories, e.g. the result of C-u C-x e on
  35    Lao in HELLO.
  36
  37  * Syntax for symbols &c in characters needs looking at.
  38
  39  * The preferred-coding-system property of charsets should probably be
  40    junked unless it can be made more useful now.
  41
  42  * find-coding-systems-for-charsets needs re-writing.
  43
  44  * find-multibyte-characters needs looking at.
  45
  46  * Implement Korean cp949/UHC and any other important missing
  47    charsets.
  48
  49  * Check up on tcvn and alternativnj.
  50
  51  * Lazy-load tables for unify-charset somehow?
  52
  53  * Should translation tables for {en,de}code and input work now or be
  54    scrapped?
  55
  56  * Defining CCL coding systems currently doesn't work.
  57
  58  * iso-2022 charsets get unified on i/o.
  59
  60  * Revisit locale processing: look at treating the language and
  61    charset parts separately.  (Language should affect things like
  62    speling and calendar, but that's not a Unicode issue.)
  63
  64  * Handle Unicode combining characters usefully, e.g. diacritics, and
  65    handle more scripts specifically (á la Devanagari).  There are
  66    issues with canonicalization.
  67
  68  * Bidi is a separate issue.
  69
  70  * DTRT with X keysyms.  We should get the right unicode for a given
  71    keysym, not decode raw bytes in some ill-defined coding system.
  72    (fx has some data on keysyms v. unicodes.)
  73
  74  * We need tabular input methods, e.g. for maths symbols.  (Not
  75    specific to Unicode.)
  76
  77  * Need multibyte text in menus, e.g. for the above.  (Not specific to
  78    Unicode.)