Merge from origin/emacs-24

[gnu-emacs] / doc / lispref / searching.texi
diff --git a/doc/lispref/searching.texi b/doc/lispref/searching.texi

index 4a222a6e7af9c0e556a93809b70237d9f6764b19..87513e8f9ce252bc2effd5ef03694f7c4345eb72 100644 (file)
--- a/doc/lispref/searching.texi
+++ b/doc/lispref/searching.texi
@@ -1,6 +1,6 @@
  @c -*-texinfo-*-
  @c This is part of the GNU Emacs Lisp Reference Manual.
-@c Copyright (C) 1990-1995, 1998-1999, 2001-2013 Free Software
+@c Copyright (C) 1990-1995, 1998-1999, 2001-2015 Free Software
  @c Foundation, Inc.
  @c See the file elisp.texi for copying conditions.
  @node Searching and Matching
@@ -137,7 +137,7 @@ the ball boy!"
  
  @group
  (word-search-forward "Please find the ball, boy.")
-     @result{} 36
+     @result{} 39
  
  ---------- Buffer: foo ----------
  He said "Please!  Find
@@ -160,16 +160,17 @@ If @var{repeat} is non-@code{nil}, then the search is repeated that many
  times.  Point is positioned at the end of the last match.
  
  @findex word-search-regexp
-Internal, @code{word-search-forward} and related functions use the
+Internally, @code{word-search-forward} and related functions use the
  function @code{word-search-regexp} to convert @var{string} to a
  regular expression that ignores punctuation.
  @end deffn
  
  @deffn Command word-search-forward-lax string &optional limit noerror repeat
  This command is identical to @code{word-search-forward}, except that
-the end of @var{string} need not match a word boundary, unless @var{string} ends
-in whitespace.  For instance, searching for @samp{ball boy} matches
-@samp{ball boyee}, but does not match @samp{aball boy}.
+the beginning or the end of @var{string} need not match a word
+boundary, unless @var{string} begins or ends in whitespace.
+For instance, searching for @samp{ball boy} matches @samp{ball boyee},
+but does not match @samp{balls boy}.
  @end deffn
  
  @deffn Command word-search-backward string &optional limit noerror repeat
@@ -181,8 +182,8 @@ beginning of the match.
  
  @deffn Command word-search-backward-lax string &optional limit noerror repeat
  This command is identical to @code{word-search-backward}, except that
-the end of @var{string} need not match a word boundary, unless @var{string} ends
-in whitespace.
+the beginning or the end of @var{string} need not match a word
+boundary, unless @var{string} begins or ends in whitespace.
  @end deffn
  
  @node Searching and Case
@@ -256,6 +257,8 @@ it easier to verify even very complex regexps.
  
  @node Syntax of Regexps
  @subsection Syntax of Regular Expressions
+@cindex regexp syntax
+@cindex syntax of regular expressions
  
    Regular expressions have a syntax in which a few characters are
  special constructs and the rest are @dfn{ordinary}.  An ordinary
@@ -273,12 +276,12 @@ expression is ordinary, unless a @samp{\} precedes it.
  therefore @samp{f} is a regular expression that matches the string
  @samp{f} and no other string.  (It does @emph{not} match the string
  @samp{fg}, but it does match a @emph{part} of that string.)  Likewise,
-@samp{o} is a regular expression that matches only @samp{o}.@refill
+@samp{o} is a regular expression that matches only @samp{o}.
  
    Any two regular expressions @var{a} and @var{b} can be concatenated.  The
  result is a regular expression that matches a string if @var{a} matches
  some amount of the beginning of that string and @var{b} matches the rest of
-the string.@refill
+the string.
  
    As a simple example, we can concatenate the regular expressions @samp{f}
  and @samp{o} to get the regular expression @samp{fo}, which matches only
@@ -293,6 +296,7 @@ need to use one of the special regular expression constructs.
  
  @node Regexp Special
  @subsubsection Special Characters in Regular Expressions
+@cindex regexp, special characters in
  
    Here is a list of the characters that are special in a regular
  expression.
@@ -304,7 +308,7 @@ expression.
  is a special character that matches any single character except a newline.
  Using concatenation, we can make regular expressions like @samp{a.b}, which
  matches any three-character string that begins with @samp{a} and ends with
-@samp{b}.@refill
+@samp{b}.
  
  @item @samp{*}
  @cindex @samp{*} in regexp
@@ -488,7 +492,7 @@ example, the regular expression that matches the @samp{\} character is
  @samp{\\}.  To write a Lisp string that contains the characters
  @samp{\\}, Lisp syntax requires you to quote each @samp{\} with another
  @samp{\}.  Therefore, the read syntax for a regular expression matching
-@samp{\} is @code{"\\\\"}.@refill
+@samp{\} is @code{"\\\\"}.
  @end table
  
  @strong{Please note:} For historical compatibility, special characters
@@ -496,7 +500,7 @@ are treated as ordinary ones if they are in contexts where their special
  meanings make no sense.  For example, @samp{*foo} treats @samp{*} as
  ordinary since there is no preceding expression on which the @samp{*}
  can act.  It is poor practice to depend on this behavior; quote the
-special character anyway, regardless of where it appears.@refill
+special character anyway, regardless of where it appears.
  
  As a @samp{\} is not special inside a character alternative, it can
  never remove the special meaning of @samp{-} or @samp{]}.  So you
@@ -537,11 +541,15 @@ and what they mean:
  @item [:ascii:]
  This matches any @acronym{ASCII} character (codes 0--127).
  @item [:alnum:]
-This matches any letter or digit.  (At present, for multibyte
-characters, it matches anything that has word syntax.)
+This matches any letter or digit.  For multibyte characters, it
+matches characters whose Unicode @samp{general-category} property
+(@pxref{Character Properties}) indicates they are alphabetic or
+decimal number characters.
  @item [:alpha:]
-This matches any letter.  (At present, for multibyte characters, it
-matches anything that has word syntax.)
+This matches any letter.  For multibyte characters, it matches
+characters whose Unicode @samp{general-category} property
+(@pxref{Character Properties}) indicates they are alphabetic
+characters.
  @item [:blank:]
  This matches space and tab only.
  @item [:cntrl:]
@@ -599,14 +607,14 @@ a table of the special @samp{\} constructs.
  specifies an alternative.
  Two regular expressions @var{a} and @var{b} with @samp{\|} in
  between form an expression that matches anything that either @var{a} or
-@var{b} matches.@refill
+@var{b} matches.
  
  Thus, @samp{foo\|bar} matches either @samp{foo} or @samp{bar}
-but no other string.@refill
+but no other string.
  
  @samp{\|} applies to the largest possible surrounding expressions.  Only a
  surrounding @samp{\( @dots{} \)} grouping can limit the grouping power of
-@samp{\|}.@refill
+@samp{\|}.
  
  If you need full backtracking capability to handle multiple uses of
  @samp{\|}, use the POSIX regular expression functions (@pxref{POSIX
@@ -785,7 +793,7 @@ matches the empty string, but only at point.
  matches the empty string, but only at the beginning or
  end of a word.  Thus, @samp{\bfoo\b} matches any occurrence of
  @samp{foo} as a separate word.  @samp{\bballs?\b} matches
-@samp{ball} or @samp{balls} as a separate word.@refill
+@samp{ball} or @samp{balls} as a separate word.
  
  @samp{\b} matches at the beginning or end of the buffer (or string)
  regardless of what text appears next to it.
@@ -893,6 +901,7 @@ beyond the minimum needed to end a sentence.
  
    These functions operate on regular expressions.
  
+@cindex quote special characters in regexp
  @defun regexp-quote string
  This function returns a regular expression whose only exact match is
  @var{string}.  Using this regular expression in @code{looking-at} will
@@ -923,6 +932,7 @@ whitespace:
  @end example
  @end defun
  
+@cindex optimize regexp
  @defun regexp-opt strings &optional paren
  This function returns an efficient regular expression that will match
  any of the strings in the list @var{strings}.  This is useful when you
@@ -982,10 +992,11 @@ list of characters @var{chars}.
  @cindex searching for regexp
  
    In GNU Emacs, you can search for the next match for a regular
-expression either incrementally or not.  For incremental search
-commands, see @ref{Regexp Search, , Regular Expression Search, emacs,
-The GNU Emacs Manual}.  Here we describe only the search functions
-useful in programs.  The principal one is @code{re-search-forward}.
+expression (@pxref{Syntax of Regexps}) either incrementally or not.
+For incremental search commands, see @ref{Regexp Search, , Regular
+Expression Search, emacs, The GNU Emacs Manual}.  Here we describe
+only the search functions useful in programs.  The principal one is
+@code{re-search-forward}.
  
    These search functions convert the regular expression to multibyte if
  the buffer is multibyte; they convert the regular expression to unibyte
@@ -1283,7 +1294,7 @@ If you did the last search in a buffer, you should omit the
  the current buffer is the one in which you performed the last search.
  Then this function edits the buffer, replacing the matched text with
  @var{replacement}.  It leaves point at the end of the replacement
-text, and returns @code{t}.
+text.
  
  If you performed the last search on a string, pass the same string as
  @var{string}.  Then this function returns a new string, in which the
@@ -1770,10 +1781,9 @@ Do not take action for this question---in other words, ``no''.
  Answer this question ``no'', and give up on the entire series of
  questions, assuming that the answers will be ``no''.
  
-@c FIXME: Document ‘unread-comment-events’?  --xfq
  @item exit-prefix
  Like @code{exit}, but add the key that was pressed to
-@code{unread-comment-events}.
+@code{unread-command-events} (@pxref{Event Input Misc}).
  
  @item act-and-exit
  Answer this question ``yes'', and give up on the entire series of