Make `initials' completion work for /hh -> /home/horn again (bug#5524).

[gnu-emacs] / doc / lispref / searching.texi
diff --git a/doc/lispref/searching.texi b/doc/lispref/searching.texi

index 6bca6c79f2722476b1c2326d2dfc543842808cd7..48780d0a348b7dc97937b36e83486bed15b80120 100644 (file)
--- a/doc/lispref/searching.texi
+++ b/doc/lispref/searching.texi
@@ -1,7 +1,7 @@
  @c -*-texinfo-*-
  @c This is part of the GNU Emacs Lisp Reference Manual.
  @c Copyright (C) 1990, 1991, 1992, 1993, 1994, 1995, 1998, 1999, 2001,
-@c   2002, 2003, 2004, 2005, 2006, 2007, 2008  Free Software Foundation, Inc.
+@c   2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009, 2010  Free Software Foundation, Inc.
  @c See the file elisp.texi for copying conditions.
  @setfilename ../../info/searching
  @node Searching and Matching, Syntax Tables, Non-ASCII Characters, Top
@@ -106,11 +106,9 @@ leaves point at the beginning of the match.
  @end deffn
  
  @deffn Command word-search-forward string &optional limit noerror repeat
-@c  @cindex word search   Redundant
  This function searches forward from point for a ``word'' match for
  @var{string}.  If it finds a match, it sets point to the end of the
  match found, and returns the new value of point.
-@c Emacs 19 feature
  
  Word matching regards @var{string} as a sequence of words, disregarding
  punctuation that separates them.  It searches the buffer for the same
@@ -155,6 +153,13 @@ If @var{repeat} is non-@code{nil}, then the search is repeated that many
  times.  Point is positioned at the end of the last match.
  @end deffn
  
+@deffn Command word-search-forward-lax string &optional limit noerror repeat
+This command is identical to @code{word-search-forward}, except that
+the end of @code{string} need not match a word boundary unless it ends
+in whitespace.  For instance, searching for @samp{ball boy} matches
+@samp{ball boyee}, but does not match @samp{aball boy}.
+@end deffn
+
  @deffn Command word-search-backward string &optional limit noerror repeat
  This function searches backward from point for a word match to
  @var{string}.  This function is just like @code{word-search-forward}
@@ -162,6 +167,12 @@ except that it searches backward and normally leaves point at the
  beginning of the match.
  @end deffn
  
+@deffn Command word-search-backward-lax string &optional limit noerror repeat
+This command is identical to @code{word-search-backward}, except that
+the end of @code{string} need not match a word boundary unless it ends
+in whitespace.
+@end deffn
+
  @node Searching and Case
  @section Searching and Case
  @cindex searching and case
@@ -176,15 +187,21 @@ regular expressions, too; thus, @samp{[aB]} would match @samp{a} or
  @code{case-fold-search} to @code{nil}.  Then all letters must match
  exactly, including case.  This is a buffer-local variable; altering the
  variable affects only the current buffer.  (@xref{Intro to
-Buffer-Local}.)  Alternatively, you may change the value of
-@code{default-case-fold-search}, which is the default value of
-@code{case-fold-search} for buffers that do not override it.
+Buffer-Local}.)  Alternatively, you may change the default value of
+@code{case-fold-search}.
  
    Note that the user-level incremental search feature handles case
-distinctions differently.  When given a lower case letter, it looks for
-a match of either case, but when given an upper case letter, it looks
-for an upper case letter only.  But this has nothing to do with the
-searching functions used in Lisp code.
+distinctions differently.  When the search string contains only lower
+case letters, the search ignores case, but when the search string
+contains one or more upper case letters, the search becomes
+case-sensitive.  But this has nothing to do with the searching
+functions used in Lisp code.
+
+@defopt case-fold-search
+This buffer-local variable determines whether searches should ignore
+case.  If the variable is @code{nil} they do not ignore case; otherwise
+they do ignore case.
+@end defopt
  
  @defopt case-replace
  This variable determines whether the higher level replacement
@@ -197,24 +214,12 @@ This variable is used by passing it as an argument to the function
  @code{replace-match}.  @xref{Replacing Match}.
  @end defopt
  
-@defopt case-fold-search
-This buffer-local variable determines whether searches should ignore
-case.  If the variable is @code{nil} they do not ignore case; otherwise
-they do ignore case.
-@end defopt
-
-@defvar default-case-fold-search
-The value of this variable is the default value for
-@code{case-fold-search} in buffers that do not override it.  This is the
-same as @code{(default-value 'case-fold-search)}.
-@end defvar
-
  @node Regular Expressions
  @section Regular Expressions
  @cindex regular expression
  @cindex regexp
  
-  A @dfn{regular expression} (@dfn{regexp}, for short) is a pattern that
+  A @dfn{regular expression}, or @dfn{regexp} for short, is a pattern that
  denotes a (possibly infinite) set of strings.  Searching for matches for
  a regexp is a very powerful operation.  This section explains how to write
  regexps; the following section says how to search for them.
@@ -334,6 +339,7 @@ preceding expression either once or not at all.  For example,
  @samp{ca?r} matches @samp{car} or @samp{cr}; nothing else.
  
  @item @samp{*?}, @samp{+?}, @samp{??}
+@cindex non-greedy repetition characters in regexp
  These are ``non-greedy'' variants of the operators @samp{*}, @samp{+}
  and @samp{?}.  Where those operators match the largest possible
  substring (consistent with matching the entire containing expression),
@@ -533,8 +539,9 @@ matches any digit, as well as @samp{+} and @samp{-}.
  This matches graphic characters---everything except @acronym{ASCII} control
  characters, space, and the delete character.
  @item [:lower:]
-This matches any lower-case letter, as determined by
-the current case table (@pxref{Case Tables}).
+This matches any lower-case letter, as determined by the current case
+table (@pxref{Case Tables}).  If @code{case-fold-search} is
+non-@code{nil}, this also matches any upper-case letter.
  @item [:multibyte:]
  This matches any multibyte character (@pxref{Text Representations}).
  @item [:nonascii:]
@@ -551,8 +558,9 @@ This matches any character that has whitespace syntax
  @item [:unibyte:]
  This matches any unibyte character (@pxref{Text Representations}).
  @item [:upper:]
-This matches any upper-case letter, as determined by
-the current case table (@pxref{Case Tables}).
+This matches any upper-case letter, as determined by the current case
+table (@pxref{Case Tables}).  If @code{case-fold-search} is
+non-@code{nil}, this also matches any lower-case letter.
  @item [:word:]
  This matches any character that has word syntax (@pxref{Syntax Class
  Table}).
@@ -641,14 +649,20 @@ occasionally there is a conflict, and that led to the introduction of
  shy groups.
  
  @item \(?: @dots{} \)
+@cindex shy groups
+@cindex non-capturing group
+@cindex unnumbered group
+@cindex @samp{(?:} in regexp
  is the @dfn{shy group} construct.  A shy group serves the first two
  purposes of an ordinary group (controlling the nesting of other
  operators), but it does not get a number, so you cannot refer back to
-its value with @samp{\@var{digit}}.
+its value with @samp{\@var{digit}}.  Shy groups are particularly
+useful for mechanically-constructed regular expressions, because they
+can be added automatically without altering the numbering of ordinary,
+non-shy groups.
  
-Shy groups are particularly useful for mechanically-constructed regular
-expressions because they can be added automatically without altering the
-numbering of any ordinary, non-shy groups.
+Shy groups are also called @dfn{non-capturing} or @dfn{unnumbered
+groups}.
  
  @item \(?@var{num}: @dots{} \)
  is the @dfn{explicitly numbered group} construct.  Normal groups get
@@ -689,7 +703,7 @@ If a particular grouping construct in the regular expression was never
  matched---for instance, if it appears inside of an alternative that
  wasn't used, or inside of a repetition that repeated zero times---then
  the corresponding @samp{\@var{digit}} construct never matches
-anything.  To use an artificial example,, @samp{\(foo\(b*\)\|lose\)\2}
+anything.  To use an artificial example, @samp{\(foo\(b*\)\|lose\)\2}
  cannot match @samp{lose}: the second alternative inside the larger
  group matches it, but then @samp{\2} is undefined and can't match
  anything.  But it can match @samp{foobb}, because the first
@@ -926,8 +940,8 @@ regular expression which is equivalent to the actual value
  
  @defun regexp-opt-depth regexp
  This function returns the total number of grouping constructs
-(parenthesized expressions) in @var{regexp}.  (This does not include
-shy groups.)
+(parenthesized expressions) in @var{regexp}.  This does not include
+shy groups (@pxref{Regexp Backslash}).
  @end defun
  
  @node Regexp Search
@@ -1060,6 +1074,11 @@ the match is available as @code{(match-end 0)}.  @xref{Match Data}.
  @end example
  @end defun
  
+@defun string-match-p regexp string &optional start
+This predicate function does what @code{string-match} does, but it
+avoids modifying the match data.
+@end defun
+
  @defun looking-at regexp
  This function determines whether the text in the current buffer directly
  following point matches the regular expression @var{regexp}.  ``Directly
@@ -1069,7 +1088,8 @@ result is @code{t} if so, @code{nil} otherwise.
  
  This function does not move point, but it updates the match data, which
  you can access using @code{match-beginning} and @code{match-end}.
-@xref{Match Data}.
+@xref{Match Data}.  If you need to test for a match without modifying
+the match data, use @code{looking-at-p}, described below.
  
  In this example, point is located directly before the @samp{T}.  If it
  were anywhere else, the result would be @code{nil}.
@@ -1087,7 +1107,7 @@ comes back" twice.
  @end example
  @end defun
  
-@defun looking-back regexp &optional limit
+@defun looking-back regexp &optional limit greedy
  This function returns @code{t} if @var{regexp} matches text before
  point, ending at point, and @code{nil} otherwise.
  
@@ -1098,6 +1118,12 @@ You can bound the time required by specifying @var{limit}, which says
  not to search before @var{limit}.  In this case, the match that is
  found must begin at or after @var{limit}.
  
+If @var{greedy} is non-@code{nil}, this function extends the match
+backwards as far as possible, stopping when a single additional
+previous character cannot be part of a match for regexp.  When the
+match is extended, its starting position is allowed to occur before
+@var{limit}.
+
  @example
  @group
  ---------- Buffer: foo ----------
@@ -1113,6 +1139,11 @@ comes back" twice.
  @end example
  @end defun
  
+@defun looking-at-p regexp
+This predicate function works like @code{looking-at}, but without
+updating the match data.
+@end defun
+
  @defvar search-spaces-regexp
  If this variable is non-@code{nil}, it should be a regular expression
  that says how to search for whitespace.  In that case, any group of
@@ -1142,20 +1173,21 @@ match, as required by POSIX.  This is much slower, so use these
  functions only when you really need the longest match.
  
    The POSIX search and match functions do not properly support the
-non-greedy repetition operators.  This is because POSIX backtracking
-conflicts with the semantics of non-greedy repetition.
+non-greedy repetition operators (@pxref{Regexp Special, non-greedy}).
+This is because POSIX backtracking conflicts with the semantics of
+non-greedy repetition.
  
-@defun posix-search-forward regexp &optional limit noerror repeat
+@deffn Command posix-search-forward regexp &optional limit noerror repeat
  This is like @code{re-search-forward} except that it performs the full
  backtracking specified by the POSIX standard for regular expression
  matching.
-@end defun
+@end deffn
  
-@defun posix-search-backward regexp &optional limit noerror repeat
+@deffn Command posix-search-backward regexp &optional limit noerror repeat
  This is like @code{re-search-backward} except that it performs the full
  backtracking specified by the POSIX standard for regular expression
  matching.
-@end defun
+@end deffn
  
  @defun posix-looking-at regexp
  This is like @code{looking-at} except that it performs the full
@@ -1395,7 +1427,7 @@ subexpression is at the 13th character (@samp{c}).
    (re-search-forward "The \\(cat \\)")
    (match-beginning 0)
    (match-beginning 1))
-    @result{} (9 9 13)
+    @result{} (17 9 13)
  @end group
  
  @group
@@ -1719,6 +1751,12 @@ use this answer.
  Display some help, then ask again.
  @end table
  
+@defvar multi-query-replace-map
+This variable holds a keymap that extends @code{query-replace-map} by
+providing additional keybindings that are useful in multi-buffer
+replacements.
+@end defvar
+
  @defvar replace-search-function
  This variable specifies a function that @code{perform-replace} calls
  to search for the next string to replace.  Its default value is
@@ -1743,12 +1781,12 @@ arguments: the first 3 arguments of @code{re-search-forward}
    This section describes some variables that hold regular expressions
  used for certain purposes in editing:
  
-@defvar page-delimiter
+@defopt page-delimiter
  This is the regular expression describing line-beginnings that separate
  pages.  The default value is @code{"^\014"} (i.e., @code{"^^L"} or
  @code{"^\C-l"}); this matches a line that starts with a formfeed
  character.
-@end defvar
+@end defopt
  
    The following two regular expressions should @emph{not} assume the
  match always starts at the beginning of a line; they should not use
@@ -1759,22 +1797,22 @@ they accept matches that start after the left margin.  In that case, a
  @samp{^} would be incorrect.  However, a @samp{^} is harmless in modes
  where a left margin is never used.
  
-@defvar paragraph-separate
+@defopt paragraph-separate
  This is the regular expression for recognizing the beginning of a line
  that separates paragraphs.  (If you change this, you may have to
  change @code{paragraph-start} also.)  The default value is
  @w{@code{"[@ \t\f]*$"}}, which matches a line that consists entirely of
  spaces, tabs, and form feeds (after its left margin).
-@end defvar
+@end defopt
  
-@defvar paragraph-start
+@defopt paragraph-start
  This is the regular expression for recognizing the beginning of a line
  that starts @emph{or} separates paragraphs.  The default value is
  @w{@code{"\f\\|[ \t]*$"}}, which matches a line containing only
  whitespace or starting with a form feed (after its left margin).
-@end defvar
+@end defopt
  
-@defvar sentence-end
+@defopt sentence-end
  If non-@code{nil}, the value should be a regular expression describing
  the end of a sentence, including the whitespace following the
  sentence.  (All paragraph boundaries also end sentences, regardless.)
@@ -1783,7 +1821,7 @@ If the value is @code{nil}, the default, then the function
  @code{sentence-end} has to construct the regexp.  That is why you
  should always call the function @code{sentence-end} to obtain the
  regexp to be used to recognize the end of a sentence.
-@end defvar
+@end defopt
  
  @defun sentence-end
  This function returns the value of the variable @code{sentence-end},