X-Git-Url: https://code.delx.au/gnu-emacs/blobdiff_plain/35fb32880c99aa7cd41b835dc17b8639b544dc69..60dd06a08276422871cd3d491a44d10d4bdc690c:/doc/lispref/nonascii.texi diff --git a/doc/lispref/nonascii.texi b/doc/lispref/nonascii.texi index 8289a49833..00a1dffed6 100644 --- a/doc/lispref/nonascii.texi +++ b/doc/lispref/nonascii.texi @@ -37,7 +37,7 @@ how they are stored in strings and buffers. Emacs buffers and strings support a large repertoire of characters from many different scripts, allowing users to type and display text -in most any known written language. +in almost any known written language. @cindex character codepoint @cindex codespace @@ -95,7 +95,7 @@ strings except for manipulating encoded text or binary non-text data. The representation for a string is determined and recorded in the string when the string is constructed. -@defopt enable-multibyte-characters +@defvar enable-multibyte-characters This variable specifies the current buffer's text representation. If it is non-@code{nil}, the buffer contains multibyte text; otherwise, it contains unibyte encoded text or binary non-text data. @@ -105,7 +105,7 @@ You cannot set this variable directly; instead, use the function The @samp{--unibyte} command line option does its job by setting the default value to @code{nil} early in startup. -@end defopt +@end defvar @defun position-bytes position Buffer positions are measured in character units. This function @@ -375,6 +375,7 @@ This property corresponds to the Unicode @code{Name} property. The value is a string consisting of upper-case Latin letters A to Z, digits, spaces, and hyphen @samp{-} characters. +@cindex unicode general category @item general-category This property corresponds to the Unicode @code{General_Category} property. The value is a symbol whose name is a 2-letter abbreviation @@ -501,13 +502,18 @@ This function stores @var{value} as the value of the property @var{propname} for the character @var{char}. @end defun -@defvar char-script-table +@defvar unicode-category-table The value of this variable is a char-table (@pxref{Char-Tables}) that -specifies, for each character, a symbol whose name is the script to -which the character belongs, according to the Unicode Standard -classification of the Unicode code space into script-specific blocks. -This char-table has a single extra slot whose value is the list of all -script symbols. +specifies, for each character, its Unicode @code{General_Category} +property as a symbol. +@end defvar + +@defvar char-script-table +The value of this variable is a char-table that specifies, for each +character, a symbol whose name is the script to which the character +belongs, according to the Unicode Standard classification of the +Unicode code space into script-specific blocks. This char-table has a +single extra slot whose value is the list of all script symbols. @end defvar @defvar char-width-table