(Defining Faces): Add `customized-face'.

[gnu-emacs] / lispref / nonascii.texi
diff --git a/lispref/nonascii.texi b/lispref/nonascii.texi

index 62bd28fd78be2254dc56801a1f17c5351c4c1a59..9683156541de1ac8e35c128f4208baa6b9dc9792 100644 (file)
--- a/lispref/nonascii.texi
+++ b/lispref/nonascii.texi
@@ -95,9 +95,10 @@ default value to @code{nil} early in startup.
  
  @defun position-bytes position
  @tindex position-bytes
-Return the byte-position corresponding to buffer position @var{position}
-in the current buffer.  If @var{position} is out of range, the value
-is @code{nil}.
+Return the byte-position corresponding to buffer position
+@var{position} in the current buffer.  This is 1 at the start of the
+buffer, and counts upward in bytes.  If @var{position} is out of
+range, the value is @code{nil}.
  @end defun
  
  @defun byte-to-position byte-position
@@ -292,8 +293,8 @@ codes cannot occur at all in multibyte text.  Only the @acronym{ASCII} codes
  0 through 127 are completely legitimate in both representations.
  
  @defun char-valid-p charcode &optional genericp
-This returns @code{t} if @var{charcode} is valid for either one of the two
-text representations.
+This returns @code{t} if @var{charcode} is valid (either for unibyte
+text or for multibyte text).
  
  @example
  (char-valid-p 65)
@@ -359,6 +360,11 @@ as the property list of that symbol.  Charset properties are used for
  special purposes within Emacs.
  @end defun
  
+@deffn Command list-charset-chars charset
+This command displays a list of characters in the character set
+@var{charset}.
+@end deffn
+
  @node Chars and Bytes
  @section Characters and Bytes
  @cindex bytes and characters
@@ -474,6 +480,13 @@ part of a buffer or a string.  One use for this is in determining which
  coding systems (@pxref{Coding Systems}) are capable of representing all
  of the text in question.
  
+@defun charset-after &optional pos
+This function return the charset of a character in the current buffer
+at position @var{pos}.  If @var{pos} is omitted or @code{nil}, it
+defauls to the current value of point.  If @var{pos} is out of range,
+the value is @code{nil}.
+@end defun
+
  @defun find-charset-region beg end &optional translation
  This function returns a list of the character sets that appear in the
  current buffer between positions @var{beg} and @var{end}.
@@ -615,6 +628,27 @@ characters; for example, there are three coding systems for the Cyrillic
  conversion, but some of them leave the choice unspecified---to be chosen
  heuristically for each file, based on the data.
  
+  In general, a coding system doesn't guarantee roundtrip identity:
+decoding a byte sequence using coding system, then encoding the
+resulting text in the same coding system, can produce a different byte
+sequence.  However, the following coding systems do guarantee that the
+byte sequence will be the same as what you originally decoded:
+
+@quotation
+chinese-big5 chinese-iso-8bit cyrillic-iso-8bit emacs-mule
+greek-iso-8bit hebrew-iso-8bit iso-latin-1 iso-latin-2 iso-latin-3
+iso-latin-4 iso-latin-5 iso-latin-8 iso-latin-9 iso-safe
+japanese-iso-8bit japanese-shift-jis korean-iso-8bit raw-text
+@end quotation
+
+  Encoding buffer text and then decoding the result can also fail to
+reproduce the original text.  For instance, if you encode Latin-2
+characters with @code{utf-8} and decode the result using the same
+coding system, you'll get Unicode characters (of charset
+@code{mule-unicode-0100-24ff}).  If you encode Unicode characters with
+@code{iso-latin-2} and decode the result with the same coding system,
+you'll get Latin-2 characters.
+
  @cindex end of line conversion
    @dfn{End of line conversion} handles three different conventions used
  on various systems for representing end of line in files.  The Unix
@@ -673,7 +707,7 @@ a coding system for decoding the file data, and @code{write-region}
  uses one to encode the buffer contents.
  
    You can specify the coding system to use either explicitly
-(@pxref{Specifying Coding Systems}), or implicitly using the defaulting
+(@pxref{Specifying Coding Systems}), or implicitly using a default
  mechanism (@pxref{Default Coding Systems}).  But these methods may not
  completely specify what to do.  For example, they may choose a coding
  system such as @code{undefined} which leaves the character code
@@ -1033,11 +1067,11 @@ for decoding (in case @var{operation} does decoding), and
  @var{encoding-system} is the coding system for encoding (in case
  @var{operation} does encoding).
  
-The argument @var{operation} should be a symbol, one of
-@code{insert-file-contents}, @code{write-region}, @code{call-process},
-@code{call-process-region}, @code{start-process}, or
-@code{open-network-stream}.  These are the names of the Emacs I/O primitives
-that can do coding system conversion.
+The argument @var{operation} should be a symbol, any one of
+@code{insert-file-contents}, @code{write-region},
+@code{start-process}, @code{call-process}, @code{call-process-region},
+or @code{open-network-stream}.  These are the names of the Emacs I/O
+primitives that can do coding system conversion.
  
  The remaining arguments should be the same arguments that might be given
  to that I/O primitive.  Depending on the primitive, one of those
@@ -1047,9 +1081,9 @@ name is the target.  For subprocess primitives, the process name is the
  target.  For @code{open-network-stream}, the target is the service name
  or port number.
  
-This function looks up the target in @code{file-coding-system-alist},
-@code{process-coding-system-alist}, or
-@code{network-coding-system-alist}, depending on @var{operation}.
+Depending on @var{operation}, this function looks up the target in
+@code{file-coding-system-alist}, @code{process-coding-system-alist},
+or @code{network-coding-system-alist}.
  @end defun
  
  @node Specifying Coding Systems