X-Git-Url: https://code.delx.au/gnu-emacs/blobdiff_plain/8d3655be5a5c41b1f0a9985bcdb614693fce67e5..6d81466ac65eb84cb77eaa73e589628b5afcb963:/admin/notes/unicode diff --git a/admin/notes/unicode b/admin/notes/unicode index 53d568dd62..cff619fd8b 100644 --- a/admin/notes/unicode +++ b/admin/notes/unicode @@ -1,8 +1,41 @@ -*-mode: text; coding: utf-8;-*- -Copyright (C) 2002-2013 Free Software Foundation, Inc. +Copyright (C) 2002-2014 Free Software Foundation, Inc. See the end of the file for license conditions. +Importing a new Unicode Standard version into Emacs +------------------------------------------------------------- + +Emacs uses the following files from the Unicode Character Database +(a.k.a. "UCD): + + . UnicodeData.txt + . BidiMirroring.txt + . IVD_Sequences.txt + +First, these files need to be copied into admin/unidata/, and then +Emacs should be rebuilt for them to take effect. Rebuilding Emacs +updates several derived files elsewhere in the Emacs source tree, +mainly in lisp/international/. + +When Emacs is rebuilt for the first time after importing the new +files, pay attention to any warning or error messages. In particular, +admin/unidata/unidata-gen.el will complain if UnicodeData.txt defines +new bidirectional attributes of characters, because unidata-gen.el, +bidi.c and dispextern.h need to be updated in that case; failure to do +so will cause aborts in redisplay. + +Next, review the changes in UnicodeData.txt vs the previous version +used by Emacs. Any changes, be it introduction of new scripts or +addition of codepoints to existing scripts, need corresponding changes +in the data used for filling char-script-table, see characters.el +around line 1300. Other databases and settings in characters.el, such +as the data for char-width-table, might also need changes. + +Any new scripts added by UnicodeData.txt will also need updates to +script-representative-chars defined in fontset.el. Other databases in +fontset.el might also need to be updated as needed. + Problems, fixmes and other unicode-related issues ------------------------------------------------------------- @@ -201,15 +234,11 @@ nontrivial changes to the build process. operating in some other language environment. etc/tutorials/TUTORIAL.ja - etc/tutorials/TUTORIAL.ko leim/quail/cyril-jis.el leim/quail/hanja-jis.el - leim/quail/hanja.el - leim/quail/hanja3.el leim/quail/japanese.el leim/quail/py-punct.el leim/quail/pypunct-b5.el - leim/quail/symbol-ksc.el lisp/international/ja-dic-cnv.el lisp/international/ja-dic-utl.el lisp/international/kinsoku.el @@ -219,6 +248,8 @@ nontrivial changes to the build process. lisp/language/japanese.el lisp/term/x-win.el + * utf-8-emacs + These files contain characters that cannot be encoded in UTF-8. leim/quail/tibetan.el