]> code.delx.au - gnu-emacs/blob - doc/lispref/text.texi
Convert consecutive FSF copyright years to ranges.
[gnu-emacs] / doc / lispref / text.texi
1 @c -*-texinfo-*-
2 @c This is part of the GNU Emacs Lisp Reference Manual.
3 @c Copyright (C) 1990-1995, 1998-2011 Free Software Foundation, Inc.
4 @c See the file elisp.texi for copying conditions.
5 @setfilename ../../info/text
6 @node Text, Non-ASCII Characters, Markers, Top
7 @chapter Text
8 @cindex text
9
10 This chapter describes the functions that deal with the text in a
11 buffer. Most examine, insert, or delete text in the current buffer,
12 often operating at point or on text adjacent to point. Many are
13 interactive. All the functions that change the text provide for undoing
14 the changes (@pxref{Undo}).
15
16 Many text-related functions operate on a region of text defined by two
17 buffer positions passed in arguments named @var{start} and @var{end}.
18 These arguments should be either markers (@pxref{Markers}) or numeric
19 character positions (@pxref{Positions}). The order of these arguments
20 does not matter; it is all right for @var{start} to be the end of the
21 region and @var{end} the beginning. For example, @code{(delete-region 1
22 10)} and @code{(delete-region 10 1)} are equivalent. An
23 @code{args-out-of-range} error is signaled if either @var{start} or
24 @var{end} is outside the accessible portion of the buffer. In an
25 interactive call, point and the mark are used for these arguments.
26
27 @cindex buffer contents
28 Throughout this chapter, ``text'' refers to the characters in the
29 buffer, together with their properties (when relevant). Keep in mind
30 that point is always between two characters, and the cursor appears on
31 the character after point.
32
33 @menu
34 * Near Point:: Examining text in the vicinity of point.
35 * Buffer Contents:: Examining text in a general fashion.
36 * Comparing Text:: Comparing substrings of buffers.
37 * Insertion:: Adding new text to a buffer.
38 * Commands for Insertion:: User-level commands to insert text.
39 * Deletion:: Removing text from a buffer.
40 * User-Level Deletion:: User-level commands to delete text.
41 * The Kill Ring:: Where removed text sometimes is saved for later use.
42 * Undo:: Undoing changes to the text of a buffer.
43 * Maintaining Undo:: How to enable and disable undo information.
44 How to control how much information is kept.
45 * Filling:: Functions for explicit filling.
46 * Margins:: How to specify margins for filling commands.
47 * Adaptive Fill:: Adaptive Fill mode chooses a fill prefix from context.
48 * Auto Filling:: How auto-fill mode is implemented to break lines.
49 * Sorting:: Functions for sorting parts of the buffer.
50 * Columns:: Computing horizontal positions, and using them.
51 * Indentation:: Functions to insert or adjust indentation.
52 * Case Changes:: Case conversion of parts of the buffer.
53 * Text Properties:: Assigning Lisp property lists to text characters.
54 * Substitution:: Replacing a given character wherever it appears.
55 * Transposition:: Swapping two portions of a buffer.
56 * Registers:: How registers are implemented. Accessing the text or
57 position stored in a register.
58 * Base 64:: Conversion to or from base 64 encoding.
59 * MD5 Checksum:: Compute the MD5 "message digest"/"checksum".
60 * Parsing HTML:: Parsing HTML and XML.
61 * Atomic Changes:: Installing several buffer changes "atomically".
62 * Change Hooks:: Supplying functions to be run when text is changed.
63 @end menu
64
65 @node Near Point
66 @section Examining Text Near Point
67 @cindex text near point
68
69 Many functions are provided to look at the characters around point.
70 Several simple functions are described here. See also @code{looking-at}
71 in @ref{Regexp Search}.
72
73 In the following four functions, ``beginning'' or ``end'' of buffer
74 refers to the beginning or end of the accessible portion.
75
76 @defun char-after &optional position
77 This function returns the character in the current buffer at (i.e.,
78 immediately after) position @var{position}. If @var{position} is out of
79 range for this purpose, either before the beginning of the buffer, or at
80 or beyond the end, then the value is @code{nil}. The default for
81 @var{position} is point.
82
83 In the following example, assume that the first character in the
84 buffer is @samp{@@}:
85
86 @example
87 @group
88 (string (char-after 1))
89 @result{} "@@"
90 @end group
91 @end example
92 @end defun
93
94 @defun char-before &optional position
95 This function returns the character in the current buffer immediately
96 before position @var{position}. If @var{position} is out of range for
97 this purpose, either at or before the beginning of the buffer, or beyond
98 the end, then the value is @code{nil}. The default for
99 @var{position} is point.
100 @end defun
101
102 @defun following-char
103 This function returns the character following point in the current
104 buffer. This is similar to @code{(char-after (point))}. However, if
105 point is at the end of the buffer, then @code{following-char} returns 0.
106
107 Remember that point is always between characters, and the cursor
108 normally appears over the character following point. Therefore, the
109 character returned by @code{following-char} is the character the
110 cursor is over.
111
112 In this example, point is between the @samp{a} and the @samp{c}.
113
114 @example
115 @group
116 ---------- Buffer: foo ----------
117 Gentlemen may cry ``Pea@point{}ce! Peace!,''
118 but there is no peace.
119 ---------- Buffer: foo ----------
120 @end group
121
122 @group
123 (string (preceding-char))
124 @result{} "a"
125 (string (following-char))
126 @result{} "c"
127 @end group
128 @end example
129 @end defun
130
131 @defun preceding-char
132 This function returns the character preceding point in the current
133 buffer. See above, under @code{following-char}, for an example. If
134 point is at the beginning of the buffer, @code{preceding-char} returns
135 0.
136 @end defun
137
138 @defun bobp
139 This function returns @code{t} if point is at the beginning of the
140 buffer. If narrowing is in effect, this means the beginning of the
141 accessible portion of the text. See also @code{point-min} in
142 @ref{Point}.
143 @end defun
144
145 @defun eobp
146 This function returns @code{t} if point is at the end of the buffer.
147 If narrowing is in effect, this means the end of accessible portion of
148 the text. See also @code{point-max} in @xref{Point}.
149 @end defun
150
151 @defun bolp
152 This function returns @code{t} if point is at the beginning of a line.
153 @xref{Text Lines}. The beginning of the buffer (or of its accessible
154 portion) always counts as the beginning of a line.
155 @end defun
156
157 @defun eolp
158 This function returns @code{t} if point is at the end of a line. The
159 end of the buffer (or of its accessible portion) is always considered
160 the end of a line.
161 @end defun
162
163 @node Buffer Contents
164 @section Examining Buffer Contents
165
166 This section describes functions that allow a Lisp program to
167 convert any portion of the text in the buffer into a string.
168
169 @defun buffer-substring start end
170 This function returns a string containing a copy of the text of the
171 region defined by positions @var{start} and @var{end} in the current
172 buffer. If the arguments are not positions in the accessible portion of
173 the buffer, @code{buffer-substring} signals an @code{args-out-of-range}
174 error.
175
176 It is not necessary for @var{start} to be less than @var{end}; the
177 arguments can be given in either order. But most often the smaller
178 argument is written first.
179
180 Here's an example which assumes Font-Lock mode is not enabled:
181
182 @example
183 @group
184 ---------- Buffer: foo ----------
185 This is the contents of buffer foo
186
187 ---------- Buffer: foo ----------
188 @end group
189
190 @group
191 (buffer-substring 1 10)
192 @result{} "This is t"
193 @end group
194 @group
195 (buffer-substring (point-max) 10)
196 @result{} "he contents of buffer foo\n"
197 @end group
198 @end example
199
200 If the text being copied has any text properties, these are copied into
201 the string along with the characters they belong to. @xref{Text
202 Properties}. However, overlays (@pxref{Overlays}) in the buffer and
203 their properties are ignored, not copied.
204
205 For example, if Font-Lock mode is enabled, you might get results like
206 these:
207
208 @example
209 @group
210 (buffer-substring 1 10)
211 @result{} #("This is t" 0 1 (fontified t) 1 9 (fontified t))
212 @end group
213 @end example
214 @end defun
215
216 @defun buffer-substring-no-properties start end
217 This is like @code{buffer-substring}, except that it does not copy text
218 properties, just the characters themselves. @xref{Text Properties}.
219 @end defun
220
221 @defun filter-buffer-substring start end &optional delete noprops
222 This function passes the buffer text between @var{start} and @var{end}
223 through the filter functions specified by the variable
224 @code{buffer-substring-filters}, and returns the value from the last
225 filter function. If @code{buffer-substring-filters} is @code{nil},
226 the value is the unaltered text from the buffer, what
227 @code{buffer-substring} would return.
228
229 If @var{delete} is non-@code{nil}, this function deletes the text
230 between @var{start} and @var{end} after copying it, like
231 @code{delete-and-extract-region}.
232
233 If @var{noprops} is non-@code{nil}, the final string returned does not
234 include text properties, while the string passed through the filters
235 still includes text properties from the buffer text.
236
237 Lisp code should use this function instead of @code{buffer-substring},
238 @code{buffer-substring-no-properties},
239 or @code{delete-and-extract-region} when copying into user-accessible
240 data structures such as the kill-ring, X clipboard, and registers.
241 Major and minor modes can add functions to
242 @code{buffer-substring-filters} to alter such text as it is copied out
243 of the buffer.
244 @end defun
245
246 @defvar buffer-substring-filters
247 This variable should be a list of functions that accept a single
248 argument, a string, and return a string.
249 @code{filter-buffer-substring} passes the buffer substring to the
250 first function in this list, and the return value of each function is
251 passed to the next function. The return value of the last function is
252 used as the return value of @code{filter-buffer-substring}.
253
254 As a special convention, point is set to the start of the buffer text
255 being operated on (i.e., the @var{start} argument for
256 @code{filter-buffer-substring}) before these functions are called.
257
258 If this variable is @code{nil}, no filtering is performed.
259 @end defvar
260
261 @defun buffer-string
262 This function returns the contents of the entire accessible portion of
263 the current buffer as a string. It is equivalent to
264
265 @example
266 (buffer-substring (point-min) (point-max))
267 @end example
268
269 @example
270 @group
271 ---------- Buffer: foo ----------
272 This is the contents of buffer foo
273
274 ---------- Buffer: foo ----------
275
276 (buffer-string)
277 @result{} "This is the contents of buffer foo\n"
278 @end group
279 @end example
280 @end defun
281
282 @defun current-word &optional strict really-word
283 This function returns the symbol (or word) at or near point, as a string.
284 The return value includes no text properties.
285
286 If the optional argument @var{really-word} is non-@code{nil}, it finds a
287 word; otherwise, it finds a symbol (which includes both word
288 characters and symbol constituent characters).
289
290 If the optional argument @var{strict} is non-@code{nil}, then point
291 must be in or next to the symbol or word---if no symbol or word is
292 there, the function returns @code{nil}. Otherwise, a nearby symbol or
293 word on the same line is acceptable.
294 @end defun
295
296 @defun thing-at-point thing
297 Return the @var{thing} around or next to point, as a string.
298
299 The argument @var{thing} is a symbol which specifies a kind of syntactic
300 entity. Possibilities include @code{symbol}, @code{list}, @code{sexp},
301 @code{defun}, @code{filename}, @code{url}, @code{word}, @code{sentence},
302 @code{whitespace}, @code{line}, @code{page}, and others.
303
304 @example
305 ---------- Buffer: foo ----------
306 Gentlemen may cry ``Pea@point{}ce! Peace!,''
307 but there is no peace.
308 ---------- Buffer: foo ----------
309
310 (thing-at-point 'word)
311 @result{} "Peace"
312 (thing-at-point 'line)
313 @result{} "Gentlemen may cry ``Peace! Peace!,''\n"
314 (thing-at-point 'whitespace)
315 @result{} nil
316 @end example
317 @end defun
318
319 @node Comparing Text
320 @section Comparing Text
321 @cindex comparing buffer text
322
323 This function lets you compare portions of the text in a buffer, without
324 copying them into strings first.
325
326 @defun compare-buffer-substrings buffer1 start1 end1 buffer2 start2 end2
327 This function lets you compare two substrings of the same buffer or two
328 different buffers. The first three arguments specify one substring,
329 giving a buffer (or a buffer name) and two positions within the
330 buffer. The last three arguments specify the other substring in the
331 same way. You can use @code{nil} for @var{buffer1}, @var{buffer2}, or
332 both to stand for the current buffer.
333
334 The value is negative if the first substring is less, positive if the
335 first is greater, and zero if they are equal. The absolute value of
336 the result is one plus the index of the first differing characters
337 within the substrings.
338
339 This function ignores case when comparing characters
340 if @code{case-fold-search} is non-@code{nil}. It always ignores
341 text properties.
342
343 Suppose the current buffer contains the text @samp{foobarbar
344 haha!rara!}; then in this example the two substrings are @samp{rbar }
345 and @samp{rara!}. The value is 2 because the first substring is greater
346 at the second character.
347
348 @example
349 (compare-buffer-substrings nil 6 11 nil 16 21)
350 @result{} 2
351 @end example
352 @end defun
353
354 @node Insertion
355 @section Inserting Text
356 @cindex insertion of text
357 @cindex text insertion
358
359 @cindex insertion before point
360 @cindex before point, insertion
361 @dfn{Insertion} means adding new text to a buffer. The inserted text
362 goes at point---between the character before point and the character
363 after point. Some insertion functions leave point before the inserted
364 text, while other functions leave it after. We call the former
365 insertion @dfn{after point} and the latter insertion @dfn{before point}.
366
367 Insertion relocates markers that point at positions after the
368 insertion point, so that they stay with the surrounding text
369 (@pxref{Markers}). When a marker points at the place of insertion,
370 insertion may or may not relocate the marker, depending on the marker's
371 insertion type (@pxref{Marker Insertion Types}). Certain special
372 functions such as @code{insert-before-markers} relocate all such markers
373 to point after the inserted text, regardless of the markers' insertion
374 type.
375
376 Insertion functions signal an error if the current buffer is
377 read-only or if they insert within read-only text.
378
379 These functions copy text characters from strings and buffers along
380 with their properties. The inserted characters have exactly the same
381 properties as the characters they were copied from. By contrast,
382 characters specified as separate arguments, not part of a string or
383 buffer, inherit their text properties from the neighboring text.
384
385 The insertion functions convert text from unibyte to multibyte in
386 order to insert in a multibyte buffer, and vice versa---if the text
387 comes from a string or from a buffer. However, they do not convert
388 unibyte character codes 128 through 255 to multibyte characters, not
389 even if the current buffer is a multibyte buffer. @xref{Converting
390 Representations}.
391
392 @defun insert &rest args
393 This function inserts the strings and/or characters @var{args} into the
394 current buffer, at point, moving point forward. In other words, it
395 inserts the text before point. An error is signaled unless all
396 @var{args} are either strings or characters. The value is @code{nil}.
397 @end defun
398
399 @defun insert-before-markers &rest args
400 This function inserts the strings and/or characters @var{args} into the
401 current buffer, at point, moving point forward. An error is signaled
402 unless all @var{args} are either strings or characters. The value is
403 @code{nil}.
404
405 This function is unlike the other insertion functions in that it
406 relocates markers initially pointing at the insertion point, to point
407 after the inserted text. If an overlay begins at the insertion point,
408 the inserted text falls outside the overlay; if a nonempty overlay
409 ends at the insertion point, the inserted text falls inside that
410 overlay.
411 @end defun
412
413 @defun insert-char character count &optional inherit
414 This function inserts @var{count} instances of @var{character} into the
415 current buffer before point. The argument @var{count} should be an
416 integer, and @var{character} must be a character. The value is @code{nil}.
417
418 This function does not convert unibyte character codes 128 through 255
419 to multibyte characters, not even if the current buffer is a multibyte
420 buffer. @xref{Converting Representations}.
421
422 If @var{inherit} is non-@code{nil}, then the inserted characters inherit
423 sticky text properties from the two characters before and after the
424 insertion point. @xref{Sticky Properties}.
425 @end defun
426
427 @defun insert-buffer-substring from-buffer-or-name &optional start end
428 This function inserts a portion of buffer @var{from-buffer-or-name}
429 (which must already exist) into the current buffer before point. The
430 text inserted is the region between @var{start} and @var{end}. (These
431 arguments default to the beginning and end of the accessible portion of
432 that buffer.) This function returns @code{nil}.
433
434 In this example, the form is executed with buffer @samp{bar} as the
435 current buffer. We assume that buffer @samp{bar} is initially empty.
436
437 @example
438 @group
439 ---------- Buffer: foo ----------
440 We hold these truths to be self-evident, that all
441 ---------- Buffer: foo ----------
442 @end group
443
444 @group
445 (insert-buffer-substring "foo" 1 20)
446 @result{} nil
447
448 ---------- Buffer: bar ----------
449 We hold these truth@point{}
450 ---------- Buffer: bar ----------
451 @end group
452 @end example
453 @end defun
454
455 @defun insert-buffer-substring-no-properties from-buffer-or-name &optional start end
456 This is like @code{insert-buffer-substring} except that it does not
457 copy any text properties.
458 @end defun
459
460 @xref{Sticky Properties}, for other insertion functions that inherit
461 text properties from the nearby text in addition to inserting it.
462 Whitespace inserted by indentation functions also inherits text
463 properties.
464
465 @node Commands for Insertion
466 @section User-Level Insertion Commands
467
468 This section describes higher-level commands for inserting text,
469 commands intended primarily for the user but useful also in Lisp
470 programs.
471
472 @deffn Command insert-buffer from-buffer-or-name
473 This command inserts the entire accessible contents of
474 @var{from-buffer-or-name} (which must exist) into the current buffer
475 after point. It leaves the mark after the inserted text. The value
476 is @code{nil}.
477 @end deffn
478
479 @deffn Command self-insert-command count
480 @cindex character insertion
481 @cindex self-insertion
482 This command inserts the last character typed; it does so @var{count}
483 times, before point, and returns @code{nil}. Most printing characters
484 are bound to this command. In routine use, @code{self-insert-command}
485 is the most frequently called function in Emacs, but programs rarely use
486 it except to install it on a keymap.
487
488 In an interactive call, @var{count} is the numeric prefix argument.
489
490 Self-insertion translates the input character through
491 @code{translation-table-for-input}. @xref{Translation of Characters}.
492
493 This command calls @code{auto-fill-function} whenever that is
494 non-@code{nil} and the character inserted is in the table
495 @code{auto-fill-chars} (@pxref{Auto Filling}).
496
497 @c Cross refs reworded to prevent overfull hbox. --rjc 15mar92
498 This command performs abbrev expansion if Abbrev mode is enabled and
499 the inserted character does not have word-constituent
500 syntax. (@xref{Abbrevs}, and @ref{Syntax Class Table}.) It is also
501 responsible for calling @code{blink-paren-function} when the inserted
502 character has close parenthesis syntax (@pxref{Blinking}).
503
504 Do not try substituting your own definition of
505 @code{self-insert-command} for the standard one. The editor command
506 loop handles this function specially.
507 @end deffn
508
509 @deffn Command newline &optional number-of-newlines
510 This command inserts newlines into the current buffer before point.
511 If @var{number-of-newlines} is supplied, that many newline characters
512 are inserted.
513
514 @cindex newline and Auto Fill mode
515 This function calls @code{auto-fill-function} if the current column
516 number is greater than the value of @code{fill-column} and
517 @var{number-of-newlines} is @code{nil}. Typically what
518 @code{auto-fill-function} does is insert a newline; thus, the overall
519 result in this case is to insert two newlines at different places: one
520 at point, and another earlier in the line. @code{newline} does not
521 auto-fill if @var{number-of-newlines} is non-@code{nil}.
522
523 This command indents to the left margin if that is not zero.
524 @xref{Margins}.
525
526 The value returned is @code{nil}. In an interactive call, @var{count}
527 is the numeric prefix argument.
528 @end deffn
529
530 @defvar overwrite-mode
531 This variable controls whether overwrite mode is in effect. The value
532 should be @code{overwrite-mode-textual}, @code{overwrite-mode-binary},
533 or @code{nil}. @code{overwrite-mode-textual} specifies textual
534 overwrite mode (treats newlines and tabs specially), and
535 @code{overwrite-mode-binary} specifies binary overwrite mode (treats
536 newlines and tabs like any other characters).
537 @end defvar
538
539 @node Deletion
540 @section Deleting Text
541 @cindex text deletion
542
543 @cindex deleting text vs killing
544 Deletion means removing part of the text in a buffer, without saving
545 it in the kill ring (@pxref{The Kill Ring}). Deleted text can't be
546 yanked, but can be reinserted using the undo mechanism (@pxref{Undo}).
547 Some deletion functions do save text in the kill ring in some special
548 cases.
549
550 All of the deletion functions operate on the current buffer.
551
552 @deffn Command erase-buffer
553 This function deletes the entire text of the current buffer
554 (@emph{not} just the accessible portion), leaving it
555 empty. If the buffer is read-only, it signals a @code{buffer-read-only}
556 error; if some of the text in it is read-only, it signals a
557 @code{text-read-only} error. Otherwise, it deletes the text without
558 asking for any confirmation. It returns @code{nil}.
559
560 Normally, deleting a large amount of text from a buffer inhibits further
561 auto-saving of that buffer ``because it has shrunk.'' However,
562 @code{erase-buffer} does not do this, the idea being that the future
563 text is not really related to the former text, and its size should not
564 be compared with that of the former text.
565 @end deffn
566
567 @deffn Command delete-region start end
568 This command deletes the text between positions @var{start} and
569 @var{end} in the current buffer, and returns @code{nil}. If point was
570 inside the deleted region, its value afterward is @var{start}.
571 Otherwise, point relocates with the surrounding text, as markers do.
572 @end deffn
573
574 @defun delete-and-extract-region start end
575 This function deletes the text between positions @var{start} and
576 @var{end} in the current buffer, and returns a string containing the
577 text just deleted.
578
579 If point was inside the deleted region, its value afterward is
580 @var{start}. Otherwise, point relocates with the surrounding text, as
581 markers do.
582 @end defun
583
584 @deffn Command delete-char count &optional killp
585 This command deletes @var{count} characters directly after point, or
586 before point if @var{count} is negative. If @var{killp} is
587 non-@code{nil}, then it saves the deleted characters in the kill ring.
588
589 In an interactive call, @var{count} is the numeric prefix argument, and
590 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
591 argument is supplied, the text is saved in the kill ring. If no prefix
592 argument is supplied, then one character is deleted, but not saved in
593 the kill ring.
594
595 The value returned is always @code{nil}.
596 @end deffn
597
598 @deffn Command delete-backward-char count &optional killp
599 @cindex deleting previous char
600 This command deletes @var{count} characters directly before point, or
601 after point if @var{count} is negative. If @var{killp} is
602 non-@code{nil}, then it saves the deleted characters in the kill ring.
603
604 In an interactive call, @var{count} is the numeric prefix argument, and
605 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
606 argument is supplied, the text is saved in the kill ring. If no prefix
607 argument is supplied, then one character is deleted, but not saved in
608 the kill ring.
609
610 The value returned is always @code{nil}.
611 @end deffn
612
613 @deffn Command backward-delete-char-untabify count &optional killp
614 @cindex tab deletion
615 This command deletes @var{count} characters backward, changing tabs
616 into spaces. When the next character to be deleted is a tab, it is
617 first replaced with the proper number of spaces to preserve alignment
618 and then one of those spaces is deleted instead of the tab. If
619 @var{killp} is non-@code{nil}, then the command saves the deleted
620 characters in the kill ring.
621
622 Conversion of tabs to spaces happens only if @var{count} is positive.
623 If it is negative, exactly @minus{}@var{count} characters after point
624 are deleted.
625
626 In an interactive call, @var{count} is the numeric prefix argument, and
627 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
628 argument is supplied, the text is saved in the kill ring. If no prefix
629 argument is supplied, then one character is deleted, but not saved in
630 the kill ring.
631
632 The value returned is always @code{nil}.
633 @end deffn
634
635 @defopt backward-delete-char-untabify-method
636 This option specifies how @code{backward-delete-char-untabify} should
637 deal with whitespace. Possible values include @code{untabify}, the
638 default, meaning convert a tab to many spaces and delete one;
639 @code{hungry}, meaning delete all tabs and spaces before point with
640 one command; @code{all} meaning delete all tabs, spaces and newlines
641 before point, and @code{nil}, meaning do nothing special for
642 whitespace characters.
643 @end defopt
644
645 @node User-Level Deletion
646 @section User-Level Deletion Commands
647
648 This section describes higher-level commands for deleting text,
649 commands intended primarily for the user but useful also in Lisp
650 programs.
651
652 @deffn Command delete-horizontal-space &optional backward-only
653 @cindex deleting whitespace
654 This function deletes all spaces and tabs around point. It returns
655 @code{nil}.
656
657 If @var{backward-only} is non-@code{nil}, the function deletes
658 spaces and tabs before point, but not after point.
659
660 In the following examples, we call @code{delete-horizontal-space} four
661 times, once on each line, with point between the second and third
662 characters on the line each time.
663
664 @example
665 @group
666 ---------- Buffer: foo ----------
667 I @point{}thought
668 I @point{} thought
669 We@point{} thought
670 Yo@point{}u thought
671 ---------- Buffer: foo ----------
672 @end group
673
674 @group
675 (delete-horizontal-space) ; @r{Four times.}
676 @result{} nil
677
678 ---------- Buffer: foo ----------
679 Ithought
680 Ithought
681 Wethought
682 You thought
683 ---------- Buffer: foo ----------
684 @end group
685 @end example
686 @end deffn
687
688 @deffn Command delete-indentation &optional join-following-p
689 This function joins the line point is on to the previous line, deleting
690 any whitespace at the join and in some cases replacing it with one
691 space. If @var{join-following-p} is non-@code{nil},
692 @code{delete-indentation} joins this line to the following line
693 instead. The function returns @code{nil}.
694
695 If there is a fill prefix, and the second of the lines being joined
696 starts with the prefix, then @code{delete-indentation} deletes the
697 fill prefix before joining the lines. @xref{Margins}.
698
699 In the example below, point is located on the line starting
700 @samp{events}, and it makes no difference if there are trailing spaces
701 in the preceding line.
702
703 @smallexample
704 @group
705 ---------- Buffer: foo ----------
706 When in the course of human
707 @point{} events, it becomes necessary
708 ---------- Buffer: foo ----------
709 @end group
710
711 (delete-indentation)
712 @result{} nil
713
714 @group
715 ---------- Buffer: foo ----------
716 When in the course of human@point{} events, it becomes necessary
717 ---------- Buffer: foo ----------
718 @end group
719 @end smallexample
720
721 After the lines are joined, the function @code{fixup-whitespace} is
722 responsible for deciding whether to leave a space at the junction.
723 @end deffn
724
725 @deffn Command fixup-whitespace
726 This function replaces all the horizontal whitespace surrounding point
727 with either one space or no space, according to the context. It
728 returns @code{nil}.
729
730 At the beginning or end of a line, the appropriate amount of space is
731 none. Before a character with close parenthesis syntax, or after a
732 character with open parenthesis or expression-prefix syntax, no space is
733 also appropriate. Otherwise, one space is appropriate. @xref{Syntax
734 Class Table}.
735
736 In the example below, @code{fixup-whitespace} is called the first time
737 with point before the word @samp{spaces} in the first line. For the
738 second invocation, point is directly after the @samp{(}.
739
740 @smallexample
741 @group
742 ---------- Buffer: foo ----------
743 This has too many @point{}spaces
744 This has too many spaces at the start of (@point{} this list)
745 ---------- Buffer: foo ----------
746 @end group
747
748 @group
749 (fixup-whitespace)
750 @result{} nil
751 (fixup-whitespace)
752 @result{} nil
753 @end group
754
755 @group
756 ---------- Buffer: foo ----------
757 This has too many spaces
758 This has too many spaces at the start of (this list)
759 ---------- Buffer: foo ----------
760 @end group
761 @end smallexample
762 @end deffn
763
764 @deffn Command just-one-space &optional n
765 @comment !!SourceFile simple.el
766 This command replaces any spaces and tabs around point with a single
767 space, or @var{n} spaces if @var{n} is specified. It returns
768 @code{nil}.
769 @end deffn
770
771 @deffn Command delete-blank-lines
772 This function deletes blank lines surrounding point. If point is on a
773 blank line with one or more blank lines before or after it, then all but
774 one of them are deleted. If point is on an isolated blank line, then it
775 is deleted. If point is on a nonblank line, the command deletes all
776 blank lines immediately following it.
777
778 A blank line is defined as a line containing only tabs and spaces.
779
780 @code{delete-blank-lines} returns @code{nil}.
781 @end deffn
782
783 @node The Kill Ring
784 @section The Kill Ring
785 @cindex kill ring
786
787 @dfn{Kill functions} delete text like the deletion functions, but save
788 it so that the user can reinsert it by @dfn{yanking}. Most of these
789 functions have @samp{kill-} in their name. By contrast, the functions
790 whose names start with @samp{delete-} normally do not save text for
791 yanking (though they can still be undone); these are ``deletion''
792 functions.
793
794 Most of the kill commands are primarily for interactive use, and are
795 not described here. What we do describe are the functions provided for
796 use in writing such commands. You can use these functions to write
797 commands for killing text. When you need to delete text for internal
798 purposes within a Lisp function, you should normally use deletion
799 functions, so as not to disturb the kill ring contents.
800 @xref{Deletion}.
801
802 Killed text is saved for later yanking in the @dfn{kill ring}. This
803 is a list that holds a number of recent kills, not just the last text
804 kill. We call this a ``ring'' because yanking treats it as having
805 elements in a cyclic order. The list is kept in the variable
806 @code{kill-ring}, and can be operated on with the usual functions for
807 lists; there are also specialized functions, described in this section,
808 that treat it as a ring.
809
810 Some people think this use of the word ``kill'' is unfortunate, since
811 it refers to operations that specifically @emph{do not} destroy the
812 entities ``killed.'' This is in sharp contrast to ordinary life, in
813 which death is permanent and ``killed'' entities do not come back to
814 life. Therefore, other metaphors have been proposed. For example, the
815 term ``cut ring'' makes sense to people who, in pre-computer days, used
816 scissors and paste to cut up and rearrange manuscripts. However, it
817 would be difficult to change the terminology now.
818
819 @menu
820 * Kill Ring Concepts:: What text looks like in the kill ring.
821 * Kill Functions:: Functions that kill text.
822 * Yanking:: How yanking is done.
823 * Yank Commands:: Commands that access the kill ring.
824 * Low-Level Kill Ring:: Functions and variables for kill ring access.
825 * Internals of Kill Ring:: Variables that hold kill ring data.
826 @end menu
827
828 @node Kill Ring Concepts
829 @comment node-name, next, previous, up
830 @subsection Kill Ring Concepts
831
832 The kill ring records killed text as strings in a list, most recent
833 first. A short kill ring, for example, might look like this:
834
835 @example
836 ("some text" "a different piece of text" "even older text")
837 @end example
838
839 @noindent
840 When the list reaches @code{kill-ring-max} entries in length, adding a
841 new entry automatically deletes the last entry.
842
843 When kill commands are interwoven with other commands, each kill
844 command makes a new entry in the kill ring. Multiple kill commands in
845 succession build up a single kill ring entry, which would be yanked as a
846 unit; the second and subsequent consecutive kill commands add text to
847 the entry made by the first one.
848
849 For yanking, one entry in the kill ring is designated the ``front'' of
850 the ring. Some yank commands ``rotate'' the ring by designating a
851 different element as the ``front.'' But this virtual rotation doesn't
852 change the list itself---the most recent entry always comes first in the
853 list.
854
855 @node Kill Functions
856 @comment node-name, next, previous, up
857 @subsection Functions for Killing
858
859 @code{kill-region} is the usual subroutine for killing text. Any
860 command that calls this function is a ``kill command'' (and should
861 probably have @samp{kill} in its name). @code{kill-region} puts the
862 newly killed text in a new element at the beginning of the kill ring or
863 adds it to the most recent element. It determines automatically (using
864 @code{last-command}) whether the previous command was a kill command,
865 and if so appends the killed text to the most recent entry.
866
867 @deffn Command kill-region start end
868 This function kills the text in the region defined by @var{start} and
869 @var{end}. The text is deleted but saved in the kill ring, along with
870 its text properties. The value is always @code{nil}.
871
872 In an interactive call, @var{start} and @var{end} are point and
873 the mark.
874
875 If the buffer or text is read-only, @code{kill-region} modifies the kill
876 ring just the same, then signals an error without modifying the buffer.
877 This is convenient because it lets the user use a series of kill
878 commands to copy text from a read-only buffer into the kill ring.
879 @end deffn
880
881 @defopt kill-read-only-ok
882 If this option is non-@code{nil}, @code{kill-region} does not signal an
883 error if the buffer or text is read-only. Instead, it simply returns,
884 updating the kill ring but not changing the buffer.
885 @end defopt
886
887 @deffn Command copy-region-as-kill start end
888 This command saves the region defined by @var{start} and @var{end} on
889 the kill ring (including text properties), but does not delete the text
890 from the buffer. It returns @code{nil}.
891
892 The command does not set @code{this-command} to @code{kill-region}, so a
893 subsequent kill command does not append to the same kill ring entry.
894
895 @c FIXME Why is it better? Why isn't copy-region-as-kill obsolete then?
896 @c Why is it used in many places in Emacs?
897 In Lisp programs, it is better to use @code{kill-new} or
898 @code{kill-append} instead of this command. @xref{Low-Level Kill Ring}.
899 @end deffn
900
901 @node Yanking
902 @subsection Yanking
903
904 Yanking means inserting text from the kill ring, but it does
905 not insert the text blindly. Yank commands and some other commands
906 use @code{insert-for-yank} to perform special processing on the
907 text that they copy into the buffer.
908
909 @defun insert-for-yank string
910 This function normally works like @code{insert} except that it doesn't
911 insert the text properties in the @code{yank-excluded-properties}
912 list. However, if any part of @var{string} has a non-@code{nil}
913 @code{yank-handler} text property, that property can do various
914 special processing on that part of the text being inserted.
915 @end defun
916
917 @defun insert-buffer-substring-as-yank buf &optional start end
918 This function resembles @code{insert-buffer-substring} except that it
919 doesn't insert the text properties in the
920 @code{yank-excluded-properties} list.
921 @end defun
922
923 You can put a @code{yank-handler} text property on all or part of
924 the text to control how it will be inserted if it is yanked. The
925 @code{insert-for-yank} function looks for that property. The property
926 value must be a list of one to four elements, with the following
927 format (where elements after the first may be omitted):
928
929 @example
930 (@var{function} @var{param} @var{noexclude} @var{undo})
931 @end example
932
933 Here is what the elements do:
934
935 @table @var
936 @item function
937 When @var{function} is present and non-@code{nil}, it is called instead of
938 @code{insert} to insert the string. @var{function} takes one
939 argument---the string to insert.
940
941 @item param
942 If @var{param} is present and non-@code{nil}, it replaces @var{string}
943 (or the part of @var{string} being processed) as the object passed to
944 @var{function} (or @code{insert}); for example, if @var{function} is
945 @code{yank-rectangle}, @var{param} should be a list of strings to
946 insert as a rectangle.
947
948 @item noexclude
949 If @var{noexclude} is present and non-@code{nil}, the normal removal of the
950 yank-excluded-properties is not performed; instead @var{function} is
951 responsible for removing those properties. This may be necessary
952 if @var{function} adjusts point before or after inserting the object.
953
954 @item undo
955 If @var{undo} is present and non-@code{nil}, it is a function that will be
956 called by @code{yank-pop} to undo the insertion of the current object.
957 It is called with two arguments, the start and end of the current
958 region. @var{function} can set @code{yank-undo-function} to override
959 the @var{undo} value.
960 @end table
961
962 @node Yank Commands
963 @comment node-name, next, previous, up
964 @subsection Functions for Yanking
965
966 This section describes higher-level commands for yanking, which are
967 intended primarily for the user but useful also in Lisp programs.
968 Both @code{yank} and @code{yank-pop} honor the
969 @code{yank-excluded-properties} variable and @code{yank-handler} text
970 property (@pxref{Yanking}).
971
972 @deffn Command yank &optional arg
973 @cindex inserting killed text
974 This command inserts before point the text at the front of the kill
975 ring. It sets the mark at the beginning of that text, using
976 @code{push-mark} (@pxref{The Mark}), and puts point at the end.
977
978 If @var{arg} is a non-@code{nil} list (which occurs interactively when
979 the user types @kbd{C-u} with no digits), then @code{yank} inserts the
980 text as described above, but puts point before the yanked text and
981 sets the mark after it.
982
983 If @var{arg} is a number, then @code{yank} inserts the @var{arg}th
984 most recently killed text---the @var{arg}th element of the kill ring
985 list, counted cyclically from the front, which is considered the
986 first element for this purpose.
987
988 @code{yank} does not alter the contents of the kill ring, unless it
989 used text provided by another program, in which case it pushes that text
990 onto the kill ring. However if @var{arg} is an integer different from
991 one, it rotates the kill ring to place the yanked string at the front.
992
993 @code{yank} returns @code{nil}.
994 @end deffn
995
996 @deffn Command yank-pop &optional arg
997 This command replaces the just-yanked entry from the kill ring with a
998 different entry from the kill ring.
999
1000 This is allowed only immediately after a @code{yank} or another
1001 @code{yank-pop}. At such a time, the region contains text that was just
1002 inserted by yanking. @code{yank-pop} deletes that text and inserts in
1003 its place a different piece of killed text. It does not add the deleted
1004 text to the kill ring, since it is already in the kill ring somewhere.
1005 It does however rotate the kill ring to place the newly yanked string at
1006 the front.
1007
1008 If @var{arg} is @code{nil}, then the replacement text is the previous
1009 element of the kill ring. If @var{arg} is numeric, the replacement is
1010 the @var{arg}th previous kill. If @var{arg} is negative, a more recent
1011 kill is the replacement.
1012
1013 The sequence of kills in the kill ring wraps around, so that after the
1014 oldest one comes the newest one, and before the newest one goes the
1015 oldest.
1016
1017 The return value is always @code{nil}.
1018 @end deffn
1019
1020 @defvar yank-undo-function
1021 If this variable is non-@code{nil}, the function @code{yank-pop} uses
1022 its value instead of @code{delete-region} to delete the text
1023 inserted by the previous @code{yank} or
1024 @code{yank-pop} command. The value must be a function of two
1025 arguments, the start and end of the current region.
1026
1027 The function @code{insert-for-yank} automatically sets this variable
1028 according to the @var{undo} element of the @code{yank-handler}
1029 text property, if there is one.
1030 @end defvar
1031
1032 @node Low-Level Kill Ring
1033 @subsection Low-Level Kill Ring
1034
1035 These functions and variables provide access to the kill ring at a
1036 lower level, but are still convenient for use in Lisp programs,
1037 because they take care of interaction with window system selections
1038 (@pxref{Window System Selections}).
1039
1040 @defun current-kill n &optional do-not-move
1041 The function @code{current-kill} rotates the yanking pointer, which
1042 designates the ``front'' of the kill ring, by @var{n} places (from newer
1043 kills to older ones), and returns the text at that place in the ring.
1044
1045 If the optional second argument @var{do-not-move} is non-@code{nil},
1046 then @code{current-kill} doesn't alter the yanking pointer; it just
1047 returns the @var{n}th kill, counting from the current yanking pointer.
1048
1049 If @var{n} is zero, indicating a request for the latest kill,
1050 @code{current-kill} calls the value of
1051 @code{interprogram-paste-function} (documented below) before
1052 consulting the kill ring. If that value is a function and calling it
1053 returns a string or a list of several string, @code{current-kill}
1054 pushes the strings onto the kill ring and returns the first string.
1055 It also sets the yanking pointer to point to the kill-ring entry of
1056 the first string returned by @code{interprogram-paste-function},
1057 regardless of the value of @var{do-not-move}. Otherwise,
1058 @code{current-kill} does not treat a zero value for @var{n} specially:
1059 it returns the entry pointed at by the yanking pointer and does not
1060 move the yanking pointer.
1061 @end defun
1062
1063 @defun kill-new string &optional replace
1064 This function pushes the text @var{string} onto the kill ring and
1065 makes the yanking pointer point to it. It discards the oldest entry
1066 if appropriate. It also invokes the value of
1067 @code{interprogram-cut-function} (see below).
1068
1069 If @var{replace} is non-@code{nil}, then @code{kill-new} replaces the
1070 first element of the kill ring with @var{string}, rather than pushing
1071 @var{string} onto the kill ring.
1072 @end defun
1073
1074 @defun kill-append string before-p
1075 This function appends the text @var{string} to the first entry in the
1076 kill ring and makes the yanking pointer point to the combined entry.
1077 Normally @var{string} goes at the end of the entry, but if
1078 @var{before-p} is non-@code{nil}, it goes at the beginning. This
1079 function also invokes the value of @code{interprogram-cut-function}
1080 (see below).
1081 @end defun
1082
1083 @defvar interprogram-paste-function
1084 This variable provides a way of transferring killed text from other
1085 programs, when you are using a window system. Its value should be
1086 @code{nil} or a function of no arguments.
1087
1088 If the value is a function, @code{current-kill} calls it to get the
1089 ``most recent kill.'' If the function returns a non-@code{nil} value,
1090 then that value is used as the ``most recent kill.'' If it returns
1091 @code{nil}, then the front of the kill ring is used.
1092
1093 To facilitate support for window systems that support multiple
1094 selections, this function may also return a list of strings. In that
1095 case, the first string is used as the ``most recent kill'', and all
1096 the other strings are pushed onto the kill ring, for easy access by
1097 @code{yank-pop}.
1098
1099 The normal use of this function is to get the window system's primary
1100 selection as the most recent kill, even if the selection belongs to
1101 another application. @xref{Window System Selections}. However, if
1102 the selection was provided by the current Emacs session, this function
1103 should return @code{nil}. (If it is hard to tell whether Emacs or
1104 some other program provided the selection, it should be good enough to
1105 use @code{string=} to compare it with the last text Emacs provided.)
1106 @end defvar
1107
1108 @defvar interprogram-cut-function
1109 This variable provides a way of communicating killed text to other
1110 programs, when you are using a window system. Its value should be
1111 @code{nil} or a function of one required argument.
1112
1113 If the value is a function, @code{kill-new} and @code{kill-append} call
1114 it with the new first element of the kill ring as the argument.
1115
1116 The normal use of this function is to set the window system's primary
1117 selection from the newly killed text.
1118 @xref{Window System Selections}.
1119 @end defvar
1120
1121 @node Internals of Kill Ring
1122 @comment node-name, next, previous, up
1123 @subsection Internals of the Kill Ring
1124
1125 The variable @code{kill-ring} holds the kill ring contents, in the
1126 form of a list of strings. The most recent kill is always at the front
1127 of the list.
1128
1129 The @code{kill-ring-yank-pointer} variable points to a link in the
1130 kill ring list, whose @sc{car} is the text to yank next. We say it
1131 identifies the ``front'' of the ring. Moving
1132 @code{kill-ring-yank-pointer} to a different link is called
1133 @dfn{rotating the kill ring}. We call the kill ring a ``ring'' because
1134 the functions that move the yank pointer wrap around from the end of the
1135 list to the beginning, or vice-versa. Rotation of the kill ring is
1136 virtual; it does not change the value of @code{kill-ring}.
1137
1138 Both @code{kill-ring} and @code{kill-ring-yank-pointer} are Lisp
1139 variables whose values are normally lists. The word ``pointer'' in the
1140 name of the @code{kill-ring-yank-pointer} indicates that the variable's
1141 purpose is to identify one element of the list for use by the next yank
1142 command.
1143
1144 The value of @code{kill-ring-yank-pointer} is always @code{eq} to one
1145 of the links in the kill ring list. The element it identifies is the
1146 @sc{car} of that link. Kill commands, which change the kill ring, also
1147 set this variable to the value of @code{kill-ring}. The effect is to
1148 rotate the ring so that the newly killed text is at the front.
1149
1150 Here is a diagram that shows the variable @code{kill-ring-yank-pointer}
1151 pointing to the second entry in the kill ring @code{("some text" "a
1152 different piece of text" "yet older text")}.
1153
1154 @example
1155 @group
1156 kill-ring ---- kill-ring-yank-pointer
1157 | |
1158 | v
1159 | --- --- --- --- --- ---
1160 --> | | |------> | | |--> | | |--> nil
1161 --- --- --- --- --- ---
1162 | | |
1163 | | |
1164 | | -->"yet older text"
1165 | |
1166 | --> "a different piece of text"
1167 |
1168 --> "some text"
1169 @end group
1170 @end example
1171
1172 @noindent
1173 This state of affairs might occur after @kbd{C-y} (@code{yank})
1174 immediately followed by @kbd{M-y} (@code{yank-pop}).
1175
1176 @defvar kill-ring
1177 This variable holds the list of killed text sequences, most recently
1178 killed first.
1179 @end defvar
1180
1181 @defvar kill-ring-yank-pointer
1182 This variable's value indicates which element of the kill ring is at the
1183 ``front'' of the ring for yanking. More precisely, the value is a tail
1184 of the value of @code{kill-ring}, and its @sc{car} is the kill string
1185 that @kbd{C-y} should yank.
1186 @end defvar
1187
1188 @defopt kill-ring-max
1189 The value of this variable is the maximum length to which the kill
1190 ring can grow, before elements are thrown away at the end. The default
1191 value for @code{kill-ring-max} is 60.
1192 @end defopt
1193
1194 @node Undo
1195 @comment node-name, next, previous, up
1196 @section Undo
1197 @cindex redo
1198
1199 Most buffers have an @dfn{undo list}, which records all changes made
1200 to the buffer's text so that they can be undone. (The buffers that
1201 don't have one are usually special-purpose buffers for which Emacs
1202 assumes that undoing is not useful. In particular, any buffer whose
1203 name begins with a space has its undo recording off by default;
1204 see @ref{Buffer Names}.) All the primitives that modify the
1205 text in the buffer automatically add elements to the front of the undo
1206 list, which is in the variable @code{buffer-undo-list}.
1207
1208 @defvar buffer-undo-list
1209 This buffer-local variable's value is the undo list of the current
1210 buffer. A value of @code{t} disables the recording of undo information.
1211 @end defvar
1212
1213 Here are the kinds of elements an undo list can have:
1214
1215 @table @code
1216 @item @var{position}
1217 This kind of element records a previous value of point; undoing this
1218 element moves point to @var{position}. Ordinary cursor motion does not
1219 make any sort of undo record, but deletion operations use these entries
1220 to record where point was before the command.
1221
1222 @item (@var{beg} . @var{end})
1223 This kind of element indicates how to delete text that was inserted.
1224 Upon insertion, the text occupied the range @var{beg}--@var{end} in the
1225 buffer.
1226
1227 @item (@var{text} . @var{position})
1228 This kind of element indicates how to reinsert text that was deleted.
1229 The deleted text itself is the string @var{text}. The place to
1230 reinsert it is @code{(abs @var{position})}. If @var{position} is
1231 positive, point was at the beginning of the deleted text, otherwise it
1232 was at the end.
1233
1234 @item (t @var{high} . @var{low})
1235 This kind of element indicates that an unmodified buffer became
1236 modified. The elements @var{high} and @var{low} are two integers, each
1237 recording 16 bits of the visited file's modification time as of when it
1238 was previously visited or saved. @code{primitive-undo} uses those
1239 values to determine whether to mark the buffer as unmodified once again;
1240 it does so only if the file's modification time matches those numbers.
1241
1242 @item (nil @var{property} @var{value} @var{beg} . @var{end})
1243 This kind of element records a change in a text property.
1244 Here's how you might undo the change:
1245
1246 @example
1247 (put-text-property @var{beg} @var{end} @var{property} @var{value})
1248 @end example
1249
1250 @item (@var{marker} . @var{adjustment})
1251 This kind of element records the fact that the marker @var{marker} was
1252 relocated due to deletion of surrounding text, and that it moved
1253 @var{adjustment} character positions. Undoing this element moves
1254 @var{marker} @minus{} @var{adjustment} characters.
1255
1256 @item (apply @var{funname} . @var{args})
1257 This is an extensible undo item, which is undone by calling
1258 @var{funname} with arguments @var{args}.
1259
1260 @item (apply @var{delta} @var{beg} @var{end} @var{funname} . @var{args})
1261 This is an extensible undo item, which records a change limited to the
1262 range @var{beg} to @var{end}, which increased the size of the buffer
1263 by @var{delta}. It is undone by calling @var{funname} with arguments
1264 @var{args}.
1265
1266 This kind of element enables undo limited to a region to determine
1267 whether the element pertains to that region.
1268
1269 @item nil
1270 This element is a boundary. The elements between two boundaries are
1271 called a @dfn{change group}; normally, each change group corresponds to
1272 one keyboard command, and undo commands normally undo an entire group as
1273 a unit.
1274 @end table
1275
1276 @defun undo-boundary
1277 This function places a boundary element in the undo list. The undo
1278 command stops at such a boundary, and successive undo commands undo
1279 to earlier and earlier boundaries. This function returns @code{nil}.
1280
1281 The editor command loop automatically calls @code{undo-boundary} just
1282 before executing each key sequence, so that each undo normally undoes
1283 the effects of one command. As an exception, the command
1284 @code{self-insert-command}, which produces self-inserting input
1285 characters (@pxref{Commands for Insertion}), may remove the boundary
1286 inserted by the command loop: a boundary is accepted for the first
1287 such character, the next 19 consecutive self-inserting input
1288 characters do not have boundaries, and then the 20th does; and so on
1289 as long as the self-inserting characters continue. Hence, sequences
1290 of consecutive character insertions can be undone as a group.
1291
1292 All buffer modifications add a boundary whenever the previous undoable
1293 change was made in some other buffer. This is to ensure that
1294 each command makes a boundary in each buffer where it makes changes.
1295
1296 Calling this function explicitly is useful for splitting the effects of
1297 a command into more than one unit. For example, @code{query-replace}
1298 calls @code{undo-boundary} after each replacement, so that the user can
1299 undo individual replacements one by one.
1300 @end defun
1301
1302 @defvar undo-in-progress
1303 This variable is normally @code{nil}, but the undo commands bind it to
1304 @code{t}. This is so that various kinds of change hooks can tell when
1305 they're being called for the sake of undoing.
1306 @end defvar
1307
1308 @defun primitive-undo count list
1309 This is the basic function for undoing elements of an undo list.
1310 It undoes the first @var{count} elements of @var{list}, returning
1311 the rest of @var{list}.
1312
1313 @code{primitive-undo} adds elements to the buffer's undo list when it
1314 changes the buffer. Undo commands avoid confusion by saving the undo
1315 list value at the beginning of a sequence of undo operations. Then the
1316 undo operations use and update the saved value. The new elements added
1317 by undoing are not part of this saved value, so they don't interfere with
1318 continuing to undo.
1319
1320 This function does not bind @code{undo-in-progress}.
1321 @end defun
1322
1323 @node Maintaining Undo
1324 @section Maintaining Undo Lists
1325
1326 This section describes how to enable and disable undo information for
1327 a given buffer. It also explains how the undo list is truncated
1328 automatically so it doesn't get too big.
1329
1330 Recording of undo information in a newly created buffer is normally
1331 enabled to start with; but if the buffer name starts with a space, the
1332 undo recording is initially disabled. You can explicitly enable or
1333 disable undo recording with the following two functions, or by setting
1334 @code{buffer-undo-list} yourself.
1335
1336 @deffn Command buffer-enable-undo &optional buffer-or-name
1337 This command enables recording undo information for buffer
1338 @var{buffer-or-name}, so that subsequent changes can be undone. If no
1339 argument is supplied, then the current buffer is used. This function
1340 does nothing if undo recording is already enabled in the buffer. It
1341 returns @code{nil}.
1342
1343 In an interactive call, @var{buffer-or-name} is the current buffer.
1344 You cannot specify any other buffer.
1345 @end deffn
1346
1347 @deffn Command buffer-disable-undo &optional buffer-or-name
1348 @cindex disabling undo
1349 This function discards the undo list of @var{buffer-or-name}, and disables
1350 further recording of undo information. As a result, it is no longer
1351 possible to undo either previous changes or any subsequent changes. If
1352 the undo list of @var{buffer-or-name} is already disabled, this function
1353 has no effect.
1354
1355 This function returns @code{nil}.
1356 @end deffn
1357
1358 As editing continues, undo lists get longer and longer. To prevent
1359 them from using up all available memory space, garbage collection trims
1360 them back to size limits you can set. (For this purpose, the ``size''
1361 of an undo list measures the cons cells that make up the list, plus the
1362 strings of deleted text.) Three variables control the range of acceptable
1363 sizes: @code{undo-limit}, @code{undo-strong-limit} and
1364 @code{undo-outer-limit}. In these variables, size is counted as the
1365 number of bytes occupied, which includes both saved text and other
1366 data.
1367
1368 @defopt undo-limit
1369 This is the soft limit for the acceptable size of an undo list. The
1370 change group at which this size is exceeded is the last one kept.
1371 @end defopt
1372
1373 @defopt undo-strong-limit
1374 This is the upper limit for the acceptable size of an undo list. The
1375 change group at which this size is exceeded is discarded itself (along
1376 with all older change groups). There is one exception: the very latest
1377 change group is only discarded if it exceeds @code{undo-outer-limit}.
1378 @end defopt
1379
1380 @defopt undo-outer-limit
1381 If at garbage collection time the undo info for the current command
1382 exceeds this limit, Emacs discards the info and displays a warning.
1383 This is a last ditch limit to prevent memory overflow.
1384 @end defopt
1385
1386 @defopt undo-ask-before-discard
1387 If this variable is non-@code{nil}, when the undo info exceeds
1388 @code{undo-outer-limit}, Emacs asks in the echo area whether to
1389 discard the info. The default value is @code{nil}, which means to
1390 discard it automatically.
1391
1392 This option is mainly intended for debugging. Garbage collection is
1393 inhibited while the question is asked, which means that Emacs might
1394 leak memory if the user waits too long before answering the question.
1395 @end defopt
1396
1397 @node Filling
1398 @comment node-name, next, previous, up
1399 @section Filling
1400 @cindex filling text
1401
1402 @dfn{Filling} means adjusting the lengths of lines (by moving the line
1403 breaks) so that they are nearly (but no greater than) a specified
1404 maximum width. Additionally, lines can be @dfn{justified}, which means
1405 inserting spaces to make the left and/or right margins line up
1406 precisely. The width is controlled by the variable @code{fill-column}.
1407 For ease of reading, lines should be no longer than 70 or so columns.
1408
1409 You can use Auto Fill mode (@pxref{Auto Filling}) to fill text
1410 automatically as you insert it, but changes to existing text may leave
1411 it improperly filled. Then you must fill the text explicitly.
1412
1413 Most of the commands in this section return values that are not
1414 meaningful. All the functions that do filling take note of the current
1415 left margin, current right margin, and current justification style
1416 (@pxref{Margins}). If the current justification style is
1417 @code{none}, the filling functions don't actually do anything.
1418
1419 Several of the filling functions have an argument @var{justify}.
1420 If it is non-@code{nil}, that requests some kind of justification. It
1421 can be @code{left}, @code{right}, @code{full}, or @code{center}, to
1422 request a specific style of justification. If it is @code{t}, that
1423 means to use the current justification style for this part of the text
1424 (see @code{current-justification}, below). Any other value is treated
1425 as @code{full}.
1426
1427 When you call the filling functions interactively, using a prefix
1428 argument implies the value @code{full} for @var{justify}.
1429
1430 @deffn Command fill-paragraph &optional justify region
1431 This command fills the paragraph at or after point. If
1432 @var{justify} is non-@code{nil}, each line is justified as well.
1433 It uses the ordinary paragraph motion commands to find paragraph
1434 boundaries. @xref{Paragraphs,,, emacs, The GNU Emacs Manual}.
1435
1436 When @var{region} is non-@code{nil}, then if Transient Mark mode is
1437 enabled and the mark is active, this command calls @code{fill-region}
1438 to fill all the paragraphs in the region, instead of filling only the
1439 current paragraph. When this command is called interactively,
1440 @var{region} is @code{t}.
1441 @end deffn
1442
1443 @deffn Command fill-region start end &optional justify nosqueeze to-eop
1444 This command fills each of the paragraphs in the region from @var{start}
1445 to @var{end}. It justifies as well if @var{justify} is
1446 non-@code{nil}.
1447
1448 If @var{nosqueeze} is non-@code{nil}, that means to leave whitespace
1449 other than line breaks untouched. If @var{to-eop} is non-@code{nil},
1450 that means to keep filling to the end of the paragraph---or the next hard
1451 newline, if @code{use-hard-newlines} is enabled (see below).
1452
1453 The variable @code{paragraph-separate} controls how to distinguish
1454 paragraphs. @xref{Standard Regexps}.
1455 @end deffn
1456
1457 @deffn Command fill-individual-paragraphs start end &optional justify citation-regexp
1458 This command fills each paragraph in the region according to its
1459 individual fill prefix. Thus, if the lines of a paragraph were indented
1460 with spaces, the filled paragraph will remain indented in the same
1461 fashion.
1462
1463 The first two arguments, @var{start} and @var{end}, are the beginning
1464 and end of the region to be filled. The third and fourth arguments,
1465 @var{justify} and @var{citation-regexp}, are optional. If
1466 @var{justify} is non-@code{nil}, the paragraphs are justified as
1467 well as filled. If @var{citation-regexp} is non-@code{nil}, it means the
1468 function is operating on a mail message and therefore should not fill
1469 the header lines. If @var{citation-regexp} is a string, it is used as
1470 a regular expression; if it matches the beginning of a line, that line
1471 is treated as a citation marker.
1472
1473 Ordinarily, @code{fill-individual-paragraphs} regards each change in
1474 indentation as starting a new paragraph. If
1475 @code{fill-individual-varying-indent} is non-@code{nil}, then only
1476 separator lines separate paragraphs. That mode can handle indented
1477 paragraphs with additional indentation on the first line.
1478 @end deffn
1479
1480 @defopt fill-individual-varying-indent
1481 This variable alters the action of @code{fill-individual-paragraphs} as
1482 described above.
1483 @end defopt
1484
1485 @deffn Command fill-region-as-paragraph start end &optional justify nosqueeze squeeze-after
1486 This command considers a region of text as a single paragraph and fills
1487 it. If the region was made up of many paragraphs, the blank lines
1488 between paragraphs are removed. This function justifies as well as
1489 filling when @var{justify} is non-@code{nil}.
1490
1491 If @var{nosqueeze} is non-@code{nil}, that means to leave whitespace
1492 other than line breaks untouched. If @var{squeeze-after} is
1493 non-@code{nil}, it specifies a position in the region, and means don't
1494 canonicalize spaces before that position.
1495
1496 In Adaptive Fill mode, this command calls @code{fill-context-prefix} to
1497 choose a fill prefix by default. @xref{Adaptive Fill}.
1498 @end deffn
1499
1500 @deffn Command justify-current-line &optional how eop nosqueeze
1501 This command inserts spaces between the words of the current line so
1502 that the line ends exactly at @code{fill-column}. It returns
1503 @code{nil}.
1504
1505 The argument @var{how}, if non-@code{nil} specifies explicitly the style
1506 of justification. It can be @code{left}, @code{right}, @code{full},
1507 @code{center}, or @code{none}. If it is @code{t}, that means to do
1508 follow specified justification style (see @code{current-justification},
1509 below). @code{nil} means to do full justification.
1510
1511 If @var{eop} is non-@code{nil}, that means do only left-justification
1512 if @code{current-justification} specifies full justification. This is
1513 used for the last line of a paragraph; even if the paragraph as a
1514 whole is fully justified, the last line should not be.
1515
1516 If @var{nosqueeze} is non-@code{nil}, that means do not change interior
1517 whitespace.
1518 @end deffn
1519
1520 @defopt default-justification
1521 This variable's value specifies the style of justification to use for
1522 text that doesn't specify a style with a text property. The possible
1523 values are @code{left}, @code{right}, @code{full}, @code{center}, or
1524 @code{none}. The default value is @code{left}.
1525 @end defopt
1526
1527 @defun current-justification
1528 This function returns the proper justification style to use for filling
1529 the text around point.
1530
1531 This returns the value of the @code{justification} text property at
1532 point, or the variable @var{default-justification} if there is no such
1533 text property. However, it returns @code{nil} rather than @code{none}
1534 to mean ``don't justify''.
1535 @end defun
1536
1537 @defopt sentence-end-double-space
1538 @anchor{Definition of sentence-end-double-space}
1539 If this variable is non-@code{nil}, a period followed by just one space
1540 does not count as the end of a sentence, and the filling functions
1541 avoid breaking the line at such a place.
1542 @end defopt
1543
1544 @defopt sentence-end-without-period
1545 If this variable is non-@code{nil}, a sentence can end without a
1546 period. This is used for languages like Thai, where sentences end
1547 with a double space but without a period.
1548 @end defopt
1549
1550 @defopt sentence-end-without-space
1551 If this variable is non-@code{nil}, it should be a string of
1552 characters that can end a sentence without following spaces.
1553 @end defopt
1554
1555 @defvar fill-paragraph-function
1556 This variable provides a way to override the filling of paragraphs.
1557 If its value is non-@code{nil}, @code{fill-paragraph} calls this
1558 function to do the work. If the function returns a non-@code{nil}
1559 value, @code{fill-paragraph} assumes the job is done, and immediately
1560 returns that value.
1561
1562 The usual use of this feature is to fill comments in programming
1563 language modes. If the function needs to fill a paragraph in the usual
1564 way, it can do so as follows:
1565
1566 @example
1567 (let ((fill-paragraph-function nil))
1568 (fill-paragraph arg))
1569 @end example
1570 @end defvar
1571
1572 @defvar fill-forward-paragraph-function
1573 This variable provides a way to override how the filling functions,
1574 such as @code{fill-region} and @code{fill-paragraph}, move forward to
1575 the next paragraph. Its value should be a function, which is called
1576 with a single argument @var{n}, the number of paragraphs to move, and
1577 should return the difference between @var{n} and the number of
1578 paragraphs actually moved. The default value of this variable is
1579 @code{forward-paragraph}. @xref{Paragraphs,,, emacs, The GNU Emacs
1580 Manual}.
1581 @end defvar
1582
1583 @defvar use-hard-newlines
1584 If this variable is non-@code{nil}, the filling functions do not delete
1585 newlines that have the @code{hard} text property. These ``hard
1586 newlines'' act as paragraph separators.
1587 @end defvar
1588
1589 @node Margins
1590 @section Margins for Filling
1591
1592 @defopt fill-prefix
1593 This buffer-local variable, if non-@code{nil}, specifies a string of
1594 text that appears at the beginning of normal text lines and should be
1595 disregarded when filling them. Any line that fails to start with the
1596 fill prefix is considered the start of a paragraph; so is any line
1597 that starts with the fill prefix followed by additional whitespace.
1598 Lines that start with the fill prefix but no additional whitespace are
1599 ordinary text lines that can be filled together. The resulting filled
1600 lines also start with the fill prefix.
1601
1602 The fill prefix follows the left margin whitespace, if any.
1603 @end defopt
1604
1605 @defopt fill-column
1606 This buffer-local variable specifies the maximum width of filled lines.
1607 Its value should be an integer, which is a number of columns. All the
1608 filling, justification, and centering commands are affected by this
1609 variable, including Auto Fill mode (@pxref{Auto Filling}).
1610
1611 As a practical matter, if you are writing text for other people to
1612 read, you should set @code{fill-column} to no more than 70. Otherwise
1613 the line will be too long for people to read comfortably, and this can
1614 make the text seem clumsy.
1615
1616 The default value for @code{fill-column} is 70.
1617 @end defopt
1618
1619 @deffn Command set-left-margin from to margin
1620 This sets the @code{left-margin} property on the text from @var{from} to
1621 @var{to} to the value @var{margin}. If Auto Fill mode is enabled, this
1622 command also refills the region to fit the new margin.
1623 @end deffn
1624
1625 @deffn Command set-right-margin from to margin
1626 This sets the @code{right-margin} property on the text from @var{from}
1627 to @var{to} to the value @var{margin}. If Auto Fill mode is enabled,
1628 this command also refills the region to fit the new margin.
1629 @end deffn
1630
1631 @defun current-left-margin
1632 This function returns the proper left margin value to use for filling
1633 the text around point. The value is the sum of the @code{left-margin}
1634 property of the character at the start of the current line (or zero if
1635 none), and the value of the variable @code{left-margin}.
1636 @end defun
1637
1638 @defun current-fill-column
1639 This function returns the proper fill column value to use for filling
1640 the text around point. The value is the value of the @code{fill-column}
1641 variable, minus the value of the @code{right-margin} property of the
1642 character after point.
1643 @end defun
1644
1645 @deffn Command move-to-left-margin &optional n force
1646 This function moves point to the left margin of the current line. The
1647 column moved to is determined by calling the function
1648 @code{current-left-margin}. If the argument @var{n} is non-@code{nil},
1649 @code{move-to-left-margin} moves forward @var{n}@minus{}1 lines first.
1650
1651 If @var{force} is non-@code{nil}, that says to fix the line's
1652 indentation if that doesn't match the left margin value.
1653 @end deffn
1654
1655 @defun delete-to-left-margin &optional from to
1656 This function removes left margin indentation from the text between
1657 @var{from} and @var{to}. The amount of indentation to delete is
1658 determined by calling @code{current-left-margin}. In no case does this
1659 function delete non-whitespace. If @var{from} and @var{to} are omitted,
1660 they default to the whole buffer.
1661 @end defun
1662
1663 @defun indent-to-left-margin
1664 This function adjusts the indentation at the beginning of the current
1665 line to the value specified by the variable @code{left-margin}. (That
1666 may involve either inserting or deleting whitespace.) This function
1667 is value of @code{indent-line-function} in Paragraph-Indent Text mode.
1668 @end defun
1669
1670 @defopt left-margin
1671 This variable specifies the base left margin column. In Fundamental
1672 mode, @kbd{C-j} indents to this column. This variable automatically
1673 becomes buffer-local when set in any fashion.
1674 @end defopt
1675
1676 @defopt fill-nobreak-predicate
1677 This variable gives major modes a way to specify not to break a line
1678 at certain places. Its value should be a list of functions. Whenever
1679 filling considers breaking the line at a certain place in the buffer,
1680 it calls each of these functions with no arguments and with point
1681 located at that place. If any of the functions returns
1682 non-@code{nil}, then the line won't be broken there.
1683 @end defopt
1684
1685 @node Adaptive Fill
1686 @section Adaptive Fill Mode
1687 @c @cindex Adaptive Fill mode "adaptive-fill-mode" is adjacent.
1688
1689 When @dfn{Adaptive Fill Mode} is enabled, Emacs determines the fill
1690 prefix automatically from the text in each paragraph being filled
1691 rather than using a predetermined value. During filling, this fill
1692 prefix gets inserted at the start of the second and subsequent lines
1693 of the paragraph as described in @ref{Filling}, and in @ref{Auto
1694 Filling}.
1695
1696 @defopt adaptive-fill-mode
1697 Adaptive Fill mode is enabled when this variable is non-@code{nil}.
1698 It is @code{t} by default.
1699 @end defopt
1700
1701 @defun fill-context-prefix from to
1702 This function implements the heart of Adaptive Fill mode; it chooses a
1703 fill prefix based on the text between @var{from} and @var{to},
1704 typically the start and end of a paragraph. It does this by looking
1705 at the first two lines of the paragraph, based on the variables
1706 described below.
1707 @c The optional argument first-line-regexp is not documented
1708 @c because it exists for internal purposes and might be eliminated
1709 @c in the future.
1710
1711 Usually, this function returns the fill prefix, a string. However,
1712 before doing this, the function makes a final check (not specially
1713 mentioned in the following) that a line starting with this prefix
1714 wouldn't look like the start of a paragraph. Should this happen, the
1715 function signals the anomaly by returning @code{nil} instead.
1716
1717 In detail, @code{fill-context-prefix} does this:
1718
1719 @enumerate
1720 @item
1721 It takes a candidate for the fill prefix from the first line---it
1722 tries first the function in @code{adaptive-fill-function} (if any),
1723 then the regular expression @code{adaptive-fill-regexp} (see below).
1724 The first non-@code{nil} result of these, or the empty string if
1725 they're both @code{nil}, becomes the first line's candidate.
1726 @item
1727 If the paragraph has as yet only one line, the function tests the
1728 validity of the prefix candidate just found. The function then
1729 returns the candidate if it's valid, or a string of spaces otherwise.
1730 (see the description of @code{adaptive-fill-first-line-regexp} below).
1731 @item
1732 When the paragraph already has two lines, the function next looks for
1733 a prefix candidate on the second line, in just the same way it did for
1734 the first line. If it doesn't find one, it returns @code{nil}.
1735 @item
1736 The function now compares the two candidate prefixes heuristically: if
1737 the non-whitespace characters in the line 2 candidate occur in the
1738 same order in the line 1 candidate, the function returns the line 2
1739 candidate. Otherwise, it returns the largest initial substring which
1740 is common to both candidates (which might be the empty string).
1741 @end enumerate
1742 @end defun
1743
1744 @defopt adaptive-fill-regexp
1745 Adaptive Fill mode matches this regular expression against the text
1746 starting after the left margin whitespace (if any) on a line; the
1747 characters it matches are that line's candidate for the fill prefix.
1748
1749 The default value matches whitespace with certain punctuation
1750 characters intermingled.
1751 @end defopt
1752
1753 @defopt adaptive-fill-first-line-regexp
1754 Used only in one-line paragraphs, this regular expression acts as an
1755 additional check of the validity of the one available candidate fill
1756 prefix: the candidate must match this regular expression, or match
1757 @code{comment-start-skip}. If it doesn't, @code{fill-context-prefix}
1758 replaces the candidate with a string of spaces ``of the same width''
1759 as it.
1760
1761 The default value of this variable is @w{@code{"\\`[ \t]*\\'"}}, which
1762 matches only a string of whitespace. The effect of this default is to
1763 force the fill prefixes found in one-line paragraphs always to be pure
1764 whitespace.
1765 @end defopt
1766
1767 @defopt adaptive-fill-function
1768 You can specify more complex ways of choosing a fill prefix
1769 automatically by setting this variable to a function. The function is
1770 called with point after the left margin (if any) of a line, and it
1771 must preserve point. It should return either ``that line's'' fill
1772 prefix or @code{nil}, meaning it has failed to determine a prefix.
1773 @end defopt
1774
1775 @node Auto Filling
1776 @comment node-name, next, previous, up
1777 @section Auto Filling
1778 @cindex filling, automatic
1779 @cindex Auto Fill mode
1780
1781 Auto Fill mode is a minor mode that fills lines automatically as text
1782 is inserted. This section describes the hook used by Auto Fill mode.
1783 For a description of functions that you can call explicitly to fill and
1784 justify existing text, see @ref{Filling}.
1785
1786 Auto Fill mode also enables the functions that change the margins and
1787 justification style to refill portions of the text. @xref{Margins}.
1788
1789 @defvar auto-fill-function
1790 The value of this buffer-local variable should be a function (of no
1791 arguments) to be called after self-inserting a character from the table
1792 @code{auto-fill-chars}. It may be @code{nil}, in which case nothing
1793 special is done in that case.
1794
1795 The value of @code{auto-fill-function} is @code{do-auto-fill} when
1796 Auto-Fill mode is enabled. That is a function whose sole purpose is to
1797 implement the usual strategy for breaking a line.
1798
1799 @quotation
1800 In older Emacs versions, this variable was named @code{auto-fill-hook},
1801 but since it is not called with the standard convention for hooks, it
1802 was renamed to @code{auto-fill-function} in version 19.
1803 @end quotation
1804 @end defvar
1805
1806 @defvar normal-auto-fill-function
1807 This variable specifies the function to use for
1808 @code{auto-fill-function}, if and when Auto Fill is turned on. Major
1809 modes can set buffer-local values for this variable to alter how Auto
1810 Fill works.
1811 @end defvar
1812
1813 @defvar auto-fill-chars
1814 A char table of characters which invoke @code{auto-fill-function} when
1815 self-inserted---space and newline in most language environments. They
1816 have an entry @code{t} in the table.
1817 @end defvar
1818
1819 @node Sorting
1820 @section Sorting Text
1821 @cindex sorting text
1822
1823 The sorting functions described in this section all rearrange text in
1824 a buffer. This is in contrast to the function @code{sort}, which
1825 rearranges the order of the elements of a list (@pxref{Rearrangement}).
1826 The values returned by these functions are not meaningful.
1827
1828 @defun sort-subr reverse nextrecfun endrecfun &optional startkeyfun endkeyfun predicate
1829 This function is the general text-sorting routine that subdivides a
1830 buffer into records and then sorts them. Most of the commands in this
1831 section use this function.
1832
1833 To understand how @code{sort-subr} works, consider the whole accessible
1834 portion of the buffer as being divided into disjoint pieces called
1835 @dfn{sort records}. The records may or may not be contiguous, but they
1836 must not overlap. A portion of each sort record (perhaps all of it) is
1837 designated as the sort key. Sorting rearranges the records in order by
1838 their sort keys.
1839
1840 Usually, the records are rearranged in order of ascending sort key.
1841 If the first argument to the @code{sort-subr} function, @var{reverse},
1842 is non-@code{nil}, the sort records are rearranged in order of
1843 descending sort key.
1844
1845 The next four arguments to @code{sort-subr} are functions that are
1846 called to move point across a sort record. They are called many times
1847 from within @code{sort-subr}.
1848
1849 @enumerate
1850 @item
1851 @var{nextrecfun} is called with point at the end of a record. This
1852 function moves point to the start of the next record. The first record
1853 is assumed to start at the position of point when @code{sort-subr} is
1854 called. Therefore, you should usually move point to the beginning of
1855 the buffer before calling @code{sort-subr}.
1856
1857 This function can indicate there are no more sort records by leaving
1858 point at the end of the buffer.
1859
1860 @item
1861 @var{endrecfun} is called with point within a record. It moves point to
1862 the end of the record.
1863
1864 @item
1865 @var{startkeyfun} is called to move point from the start of a record to
1866 the start of the sort key. This argument is optional; if it is omitted,
1867 the whole record is the sort key. If supplied, the function should
1868 either return a non-@code{nil} value to be used as the sort key, or
1869 return @code{nil} to indicate that the sort key is in the buffer
1870 starting at point. In the latter case, @var{endkeyfun} is called to
1871 find the end of the sort key.
1872
1873 @item
1874 @var{endkeyfun} is called to move point from the start of the sort key
1875 to the end of the sort key. This argument is optional. If
1876 @var{startkeyfun} returns @code{nil} and this argument is omitted (or
1877 @code{nil}), then the sort key extends to the end of the record. There
1878 is no need for @var{endkeyfun} if @var{startkeyfun} returns a
1879 non-@code{nil} value.
1880 @end enumerate
1881
1882 The argument @var{predicate} is the function to use to compare keys.
1883 If keys are numbers, it defaults to @code{<}; otherwise it defaults to
1884 @code{string<}.
1885
1886 As an example of @code{sort-subr}, here is the complete function
1887 definition for @code{sort-lines}:
1888
1889 @example
1890 @group
1891 ;; @r{Note that the first two lines of doc string}
1892 ;; @r{are effectively one line when viewed by a user.}
1893 (defun sort-lines (reverse beg end)
1894 "Sort lines in region alphabetically;\
1895 argument means descending order.
1896 Called from a program, there are three arguments:
1897 @end group
1898 @group
1899 REVERSE (non-nil means reverse order),\
1900 BEG and END (region to sort).
1901 The variable `sort-fold-case' determines\
1902 whether alphabetic case affects
1903 the sort order."
1904 @end group
1905 @group
1906 (interactive "P\nr")
1907 (save-excursion
1908 (save-restriction
1909 (narrow-to-region beg end)
1910 (goto-char (point-min))
1911 (let ((inhibit-field-text-motion t))
1912 (sort-subr reverse 'forward-line 'end-of-line)))))
1913 @end group
1914 @end example
1915
1916 Here @code{forward-line} moves point to the start of the next record,
1917 and @code{end-of-line} moves point to the end of record. We do not pass
1918 the arguments @var{startkeyfun} and @var{endkeyfun}, because the entire
1919 record is used as the sort key.
1920
1921 The @code{sort-paragraphs} function is very much the same, except that
1922 its @code{sort-subr} call looks like this:
1923
1924 @example
1925 @group
1926 (sort-subr reverse
1927 (function
1928 (lambda ()
1929 (while (and (not (eobp))
1930 (looking-at paragraph-separate))
1931 (forward-line 1))))
1932 'forward-paragraph)
1933 @end group
1934 @end example
1935
1936 Markers pointing into any sort records are left with no useful
1937 position after @code{sort-subr} returns.
1938 @end defun
1939
1940 @defopt sort-fold-case
1941 If this variable is non-@code{nil}, @code{sort-subr} and the other
1942 buffer sorting functions ignore case when comparing strings.
1943 @end defopt
1944
1945 @deffn Command sort-regexp-fields reverse record-regexp key-regexp start end
1946 This command sorts the region between @var{start} and @var{end}
1947 alphabetically as specified by @var{record-regexp} and @var{key-regexp}.
1948 If @var{reverse} is a negative integer, then sorting is in reverse
1949 order.
1950
1951 Alphabetical sorting means that two sort keys are compared by
1952 comparing the first characters of each, the second characters of each,
1953 and so on. If a mismatch is found, it means that the sort keys are
1954 unequal; the sort key whose character is less at the point of first
1955 mismatch is the lesser sort key. The individual characters are compared
1956 according to their numerical character codes in the Emacs character set.
1957
1958 The value of the @var{record-regexp} argument specifies how to divide
1959 the buffer into sort records. At the end of each record, a search is
1960 done for this regular expression, and the text that matches it is taken
1961 as the next record. For example, the regular expression @samp{^.+$},
1962 which matches lines with at least one character besides a newline, would
1963 make each such line into a sort record. @xref{Regular Expressions}, for
1964 a description of the syntax and meaning of regular expressions.
1965
1966 The value of the @var{key-regexp} argument specifies what part of each
1967 record is the sort key. The @var{key-regexp} could match the whole
1968 record, or only a part. In the latter case, the rest of the record has
1969 no effect on the sorted order of records, but it is carried along when
1970 the record moves to its new position.
1971
1972 The @var{key-regexp} argument can refer to the text matched by a
1973 subexpression of @var{record-regexp}, or it can be a regular expression
1974 on its own.
1975
1976 If @var{key-regexp} is:
1977
1978 @table @asis
1979 @item @samp{\@var{digit}}
1980 then the text matched by the @var{digit}th @samp{\(...\)} parenthesis
1981 grouping in @var{record-regexp} is the sort key.
1982
1983 @item @samp{\&}
1984 then the whole record is the sort key.
1985
1986 @item a regular expression
1987 then @code{sort-regexp-fields} searches for a match for the regular
1988 expression within the record. If such a match is found, it is the sort
1989 key. If there is no match for @var{key-regexp} within a record then
1990 that record is ignored, which means its position in the buffer is not
1991 changed. (The other records may move around it.)
1992 @end table
1993
1994 For example, if you plan to sort all the lines in the region by the
1995 first word on each line starting with the letter @samp{f}, you should
1996 set @var{record-regexp} to @samp{^.*$} and set @var{key-regexp} to
1997 @samp{\<f\w*\>}. The resulting expression looks like this:
1998
1999 @example
2000 @group
2001 (sort-regexp-fields nil "^.*$" "\\<f\\w*\\>"
2002 (region-beginning)
2003 (region-end))
2004 @end group
2005 @end example
2006
2007 If you call @code{sort-regexp-fields} interactively, it prompts for
2008 @var{record-regexp} and @var{key-regexp} in the minibuffer.
2009 @end deffn
2010
2011 @deffn Command sort-lines reverse start end
2012 This command alphabetically sorts lines in the region between
2013 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2014 is in reverse order.
2015 @end deffn
2016
2017 @deffn Command sort-paragraphs reverse start end
2018 This command alphabetically sorts paragraphs in the region between
2019 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2020 is in reverse order.
2021 @end deffn
2022
2023 @deffn Command sort-pages reverse start end
2024 This command alphabetically sorts pages in the region between
2025 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2026 is in reverse order.
2027 @end deffn
2028
2029 @deffn Command sort-fields field start end
2030 This command sorts lines in the region between @var{start} and
2031 @var{end}, comparing them alphabetically by the @var{field}th field
2032 of each line. Fields are separated by whitespace and numbered starting
2033 from 1. If @var{field} is negative, sorting is by the
2034 @w{@minus{}@var{field}th} field from the end of the line. This command
2035 is useful for sorting tables.
2036 @end deffn
2037
2038 @deffn Command sort-numeric-fields field start end
2039 This command sorts lines in the region between @var{start} and
2040 @var{end}, comparing them numerically by the @var{field}th field of
2041 each line. Fields are separated by whitespace and numbered starting
2042 from 1. The specified field must contain a number in each line of the
2043 region. Numbers starting with 0 are treated as octal, and numbers
2044 starting with @samp{0x} are treated as hexadecimal.
2045
2046 If @var{field} is negative, sorting is by the
2047 @w{@minus{}@var{field}th} field from the end of the line. This
2048 command is useful for sorting tables.
2049 @end deffn
2050
2051 @defopt sort-numeric-base
2052 This variable specifies the default radix for
2053 @code{sort-numeric-fields} to parse numbers.
2054 @end defopt
2055
2056 @deffn Command sort-columns reverse &optional beg end
2057 This command sorts the lines in the region between @var{beg} and
2058 @var{end}, comparing them alphabetically by a certain range of
2059 columns. The column positions of @var{beg} and @var{end} bound the
2060 range of columns to sort on.
2061
2062 If @var{reverse} is non-@code{nil}, the sort is in reverse order.
2063
2064 One unusual thing about this command is that the entire line
2065 containing position @var{beg}, and the entire line containing position
2066 @var{end}, are included in the region sorted.
2067
2068 Note that @code{sort-columns} rejects text that contains tabs, because
2069 tabs could be split across the specified columns. Use @kbd{M-x
2070 untabify} to convert tabs to spaces before sorting.
2071
2072 When possible, this command actually works by calling the @code{sort}
2073 utility program.
2074 @end deffn
2075
2076 @node Columns
2077 @comment node-name, next, previous, up
2078 @section Counting Columns
2079 @cindex columns
2080 @cindex counting columns
2081 @cindex horizontal position
2082
2083 The column functions convert between a character position (counting
2084 characters from the beginning of the buffer) and a column position
2085 (counting screen characters from the beginning of a line).
2086
2087 These functions count each character according to the number of
2088 columns it occupies on the screen. This means control characters count
2089 as occupying 2 or 4 columns, depending upon the value of
2090 @code{ctl-arrow}, and tabs count as occupying a number of columns that
2091 depends on the value of @code{tab-width} and on the column where the tab
2092 begins. @xref{Usual Display}.
2093
2094 Column number computations ignore the width of the window and the
2095 amount of horizontal scrolling. Consequently, a column value can be
2096 arbitrarily high. The first (or leftmost) column is numbered 0. They
2097 also ignore overlays and text properties, aside from invisibility.
2098
2099 @defun current-column
2100 This function returns the horizontal position of point, measured in
2101 columns, counting from 0 at the left margin. The column position is the
2102 sum of the widths of all the displayed representations of the characters
2103 between the start of the current line and point.
2104
2105 For an example of using @code{current-column}, see the description of
2106 @code{count-lines} in @ref{Text Lines}.
2107 @end defun
2108
2109 @deffn Command move-to-column column &optional force
2110 This function moves point to @var{column} in the current line. The
2111 calculation of @var{column} takes into account the widths of the
2112 displayed representations of the characters between the start of the
2113 line and point.
2114
2115 When called interactively, @var{column} is the value of prefix numeric
2116 argument. If @var{column} is not an integer, an error is signaled.
2117
2118 If column @var{column} is beyond the end of the line, point moves to
2119 the end of the line. If @var{column} is negative, point moves to the
2120 beginning of the line.
2121
2122 If it is impossible to move to column @var{column} because that is in
2123 the middle of a multicolumn character such as a tab, point moves to the
2124 end of that character. However, if @var{force} is non-@code{nil}, and
2125 @var{column} is in the middle of a tab, then @code{move-to-column}
2126 converts the tab into spaces so that it can move precisely to column
2127 @var{column}. Other multicolumn characters can cause anomalies despite
2128 @var{force}, since there is no way to split them.
2129
2130 The argument @var{force} also has an effect if the line isn't long
2131 enough to reach column @var{column}; if it is @code{t}, that means to
2132 add whitespace at the end of the line to reach that column.
2133
2134 The return value is the column number actually moved to.
2135 @end deffn
2136
2137 @node Indentation
2138 @section Indentation
2139 @cindex indentation
2140
2141 The indentation functions are used to examine, move to, and change
2142 whitespace that is at the beginning of a line. Some of the functions
2143 can also change whitespace elsewhere on a line. Columns and indentation
2144 count from zero at the left margin.
2145
2146 @menu
2147 * Primitive Indent:: Functions used to count and insert indentation.
2148 * Mode-Specific Indent:: Customize indentation for different modes.
2149 * Region Indent:: Indent all the lines in a region.
2150 * Relative Indent:: Indent the current line based on previous lines.
2151 * Indent Tabs:: Adjustable, typewriter-like tab stops.
2152 * Motion by Indent:: Move to first non-blank character.
2153 @end menu
2154
2155 @node Primitive Indent
2156 @subsection Indentation Primitives
2157
2158 This section describes the primitive functions used to count and
2159 insert indentation. The functions in the following sections use these
2160 primitives. @xref{Width}, for related functions.
2161
2162 @defun current-indentation
2163 @comment !!Type Primitive Function
2164 @comment !!SourceFile indent.c
2165 This function returns the indentation of the current line, which is
2166 the horizontal position of the first nonblank character. If the
2167 contents are entirely blank, then this is the horizontal position of the
2168 end of the line.
2169 @end defun
2170
2171 @deffn Command indent-to column &optional minimum
2172 @comment !!Type Primitive Function
2173 @comment !!SourceFile indent.c
2174 This function indents from point with tabs and spaces until @var{column}
2175 is reached. If @var{minimum} is specified and non-@code{nil}, then at
2176 least that many spaces are inserted even if this requires going beyond
2177 @var{column}. Otherwise the function does nothing if point is already
2178 beyond @var{column}. The value is the column at which the inserted
2179 indentation ends.
2180
2181 The inserted whitespace characters inherit text properties from the
2182 surrounding text (usually, from the preceding text only). @xref{Sticky
2183 Properties}.
2184 @end deffn
2185
2186 @defopt indent-tabs-mode
2187 @comment !!SourceFile indent.c
2188 If this variable is non-@code{nil}, indentation functions can insert
2189 tabs as well as spaces. Otherwise, they insert only spaces. Setting
2190 this variable automatically makes it buffer-local in the current buffer.
2191 @end defopt
2192
2193 @node Mode-Specific Indent
2194 @subsection Indentation Controlled by Major Mode
2195
2196 An important function of each major mode is to customize the @key{TAB}
2197 key to indent properly for the language being edited. This section
2198 describes the mechanism of the @key{TAB} key and how to control it.
2199 The functions in this section return unpredictable values.
2200
2201 @defvar indent-line-function
2202 This variable's value is the function to be used by @key{TAB} (and
2203 various commands) to indent the current line. The command
2204 @code{indent-according-to-mode} does little more than call this function.
2205
2206 In Lisp mode, the value is the symbol @code{lisp-indent-line}; in C
2207 mode, @code{c-indent-line}; in Fortran mode, @code{fortran-indent-line}.
2208 The default value is @code{indent-relative}. @xref{Auto-Indentation}.
2209 @end defvar
2210
2211 @deffn Command indent-according-to-mode
2212 This command calls the function in @code{indent-line-function} to
2213 indent the current line in a way appropriate for the current major mode.
2214 @end deffn
2215
2216 @deffn Command indent-for-tab-command &optional rigid
2217 This command calls the function in @code{indent-line-function} to
2218 indent the current line; however, if that function is
2219 @code{indent-to-left-margin}, @code{insert-tab} is called instead.
2220 (That is a trivial command that inserts a tab character.) If
2221 @var{rigid} is non-@code{nil}, this function also rigidly indents the
2222 entire balanced expression that starts at the beginning of the current
2223 line, to reflect change in indentation of the current line.
2224 @end deffn
2225
2226 @deffn Command newline-and-indent
2227 This function inserts a newline, then indents the new line (the one
2228 following the newline just inserted) according to the major mode.
2229
2230 It does indentation by calling the current @code{indent-line-function}.
2231 In programming language modes, this is the same thing @key{TAB} does,
2232 but in some text modes, where @key{TAB} inserts a tab,
2233 @code{newline-and-indent} indents to the column specified by
2234 @code{left-margin}.
2235 @end deffn
2236
2237 @deffn Command reindent-then-newline-and-indent
2238 @comment !!SourceFile simple.el
2239 This command reindents the current line, inserts a newline at point,
2240 and then indents the new line (the one following the newline just
2241 inserted).
2242
2243 This command does indentation on both lines according to the current
2244 major mode, by calling the current value of @code{indent-line-function}.
2245 In programming language modes, this is the same thing @key{TAB} does,
2246 but in some text modes, where @key{TAB} inserts a tab,
2247 @code{reindent-then-newline-and-indent} indents to the column specified
2248 by @code{left-margin}.
2249 @end deffn
2250
2251 @node Region Indent
2252 @subsection Indenting an Entire Region
2253
2254 This section describes commands that indent all the lines in the
2255 region. They return unpredictable values.
2256
2257 @deffn Command indent-region start end &optional to-column
2258 This command indents each nonblank line starting between @var{start}
2259 (inclusive) and @var{end} (exclusive). If @var{to-column} is
2260 @code{nil}, @code{indent-region} indents each nonblank line by calling
2261 the current mode's indentation function, the value of
2262 @code{indent-line-function}.
2263
2264 If @var{to-column} is non-@code{nil}, it should be an integer
2265 specifying the number of columns of indentation; then this function
2266 gives each line exactly that much indentation, by either adding or
2267 deleting whitespace.
2268
2269 If there is a fill prefix, @code{indent-region} indents each line
2270 by making it start with the fill prefix.
2271 @end deffn
2272
2273 @defvar indent-region-function
2274 The value of this variable is a function that can be used by
2275 @code{indent-region} as a short cut. It should take two arguments, the
2276 start and end of the region. You should design the function so
2277 that it will produce the same results as indenting the lines of the
2278 region one by one, but presumably faster.
2279
2280 If the value is @code{nil}, there is no short cut, and
2281 @code{indent-region} actually works line by line.
2282
2283 A short-cut function is useful in modes such as C mode and Lisp mode,
2284 where the @code{indent-line-function} must scan from the beginning of
2285 the function definition: applying it to each line would be quadratic in
2286 time. The short cut can update the scan information as it moves through
2287 the lines indenting them; this takes linear time. In a mode where
2288 indenting a line individually is fast, there is no need for a short cut.
2289
2290 @code{indent-region} with a non-@code{nil} argument @var{to-column} has
2291 a different meaning and does not use this variable.
2292 @end defvar
2293
2294 @deffn Command indent-rigidly start end count
2295 This command indents all lines starting between @var{start}
2296 (inclusive) and @var{end} (exclusive) sideways by @var{count} columns.
2297 This ``preserves the shape'' of the affected region, moving it as a
2298 rigid unit. Consequently, this command is useful not only for indenting
2299 regions of unindented text, but also for indenting regions of formatted
2300 code.
2301
2302 For example, if @var{count} is 3, this command adds 3 columns of
2303 indentation to each of the lines beginning in the region specified.
2304
2305 In Mail mode, @kbd{C-c C-y} (@code{mail-yank-original}) uses
2306 @code{indent-rigidly} to indent the text copied from the message being
2307 replied to.
2308 @end deffn
2309
2310 @deffn Command indent-code-rigidly start end columns &optional nochange-regexp
2311 This is like @code{indent-rigidly}, except that it doesn't alter lines
2312 that start within strings or comments.
2313
2314 In addition, it doesn't alter a line if @var{nochange-regexp} matches at
2315 the beginning of the line (if @var{nochange-regexp} is non-@code{nil}).
2316 @end deffn
2317
2318 @node Relative Indent
2319 @subsection Indentation Relative to Previous Lines
2320
2321 This section describes two commands that indent the current line
2322 based on the contents of previous lines.
2323
2324 @deffn Command indent-relative &optional unindented-ok
2325 This command inserts whitespace at point, extending to the same
2326 column as the next @dfn{indent point} of the previous nonblank line. An
2327 indent point is a non-whitespace character following whitespace. The
2328 next indent point is the first one at a column greater than the current
2329 column of point. For example, if point is underneath and to the left of
2330 the first non-blank character of a line of text, it moves to that column
2331 by inserting whitespace.
2332
2333 If the previous nonblank line has no next indent point (i.e., none at a
2334 great enough column position), @code{indent-relative} either does
2335 nothing (if @var{unindented-ok} is non-@code{nil}) or calls
2336 @code{tab-to-tab-stop}. Thus, if point is underneath and to the right
2337 of the last column of a short line of text, this command ordinarily
2338 moves point to the next tab stop by inserting whitespace.
2339
2340 The return value of @code{indent-relative} is unpredictable.
2341
2342 In the following example, point is at the beginning of the second
2343 line:
2344
2345 @example
2346 @group
2347 This line is indented twelve spaces.
2348 @point{}The quick brown fox jumped.
2349 @end group
2350 @end example
2351
2352 @noindent
2353 Evaluation of the expression @code{(indent-relative nil)} produces the
2354 following:
2355
2356 @example
2357 @group
2358 This line is indented twelve spaces.
2359 @point{}The quick brown fox jumped.
2360 @end group
2361 @end example
2362
2363 In this next example, point is between the @samp{m} and @samp{p} of
2364 @samp{jumped}:
2365
2366 @example
2367 @group
2368 This line is indented twelve spaces.
2369 The quick brown fox jum@point{}ped.
2370 @end group
2371 @end example
2372
2373 @noindent
2374 Evaluation of the expression @code{(indent-relative nil)} produces the
2375 following:
2376
2377 @example
2378 @group
2379 This line is indented twelve spaces.
2380 The quick brown fox jum @point{}ped.
2381 @end group
2382 @end example
2383 @end deffn
2384
2385 @deffn Command indent-relative-maybe
2386 @comment !!SourceFile indent.el
2387 This command indents the current line like the previous nonblank line,
2388 by calling @code{indent-relative} with @code{t} as the
2389 @var{unindented-ok} argument. The return value is unpredictable.
2390
2391 If the previous nonblank line has no indent points beyond the current
2392 column, this command does nothing.
2393 @end deffn
2394
2395 @node Indent Tabs
2396 @comment node-name, next, previous, up
2397 @subsection Adjustable ``Tab Stops''
2398 @cindex tabs stops for indentation
2399
2400 This section explains the mechanism for user-specified ``tab stops''
2401 and the mechanisms that use and set them. The name ``tab stops'' is
2402 used because the feature is similar to that of the tab stops on a
2403 typewriter. The feature works by inserting an appropriate number of
2404 spaces and tab characters to reach the next tab stop column; it does not
2405 affect the display of tab characters in the buffer (@pxref{Usual
2406 Display}). Note that the @key{TAB} character as input uses this tab
2407 stop feature only in a few major modes, such as Text mode.
2408 @xref{Tab Stops,,, emacs, The GNU Emacs Manual}.
2409
2410 @deffn Command tab-to-tab-stop
2411 This command inserts spaces or tabs before point, up to the next tab
2412 stop column defined by @code{tab-stop-list}. It searches the list for
2413 an element greater than the current column number, and uses that element
2414 as the column to indent to. It does nothing if no such element is
2415 found.
2416 @end deffn
2417
2418 @defopt tab-stop-list
2419 This variable is the list of tab stop columns used by
2420 @code{tab-to-tab-stops}. The elements should be integers in increasing
2421 order. The tab stop columns need not be evenly spaced.
2422
2423 Use @kbd{M-x edit-tab-stops} to edit the location of tab stops
2424 interactively.
2425 @end defopt
2426
2427 @node Motion by Indent
2428 @subsection Indentation-Based Motion Commands
2429
2430 These commands, primarily for interactive use, act based on the
2431 indentation in the text.
2432
2433 @deffn Command back-to-indentation
2434 @comment !!SourceFile simple.el
2435 This command moves point to the first non-whitespace character in the
2436 current line (which is the line in which point is located). It returns
2437 @code{nil}.
2438 @end deffn
2439
2440 @deffn Command backward-to-indentation &optional arg
2441 @comment !!SourceFile simple.el
2442 This command moves point backward @var{arg} lines and then to the
2443 first nonblank character on that line. It returns @code{nil}.
2444 If @var{arg} is omitted or @code{nil}, it defaults to 1.
2445 @end deffn
2446
2447 @deffn Command forward-to-indentation &optional arg
2448 @comment !!SourceFile simple.el
2449 This command moves point forward @var{arg} lines and then to the first
2450 nonblank character on that line. It returns @code{nil}.
2451 If @var{arg} is omitted or @code{nil}, it defaults to 1.
2452 @end deffn
2453
2454 @node Case Changes
2455 @comment node-name, next, previous, up
2456 @section Case Changes
2457 @cindex case conversion in buffers
2458
2459 The case change commands described here work on text in the current
2460 buffer. @xref{Case Conversion}, for case conversion functions that work
2461 on strings and characters. @xref{Case Tables}, for how to customize
2462 which characters are upper or lower case and how to convert them.
2463
2464 @deffn Command capitalize-region start end
2465 This function capitalizes all words in the region defined by
2466 @var{start} and @var{end}. To capitalize means to convert each word's
2467 first character to upper case and convert the rest of each word to lower
2468 case. The function returns @code{nil}.
2469
2470 If one end of the region is in the middle of a word, the part of the
2471 word within the region is treated as an entire word.
2472
2473 When @code{capitalize-region} is called interactively, @var{start} and
2474 @var{end} are point and the mark, with the smallest first.
2475
2476 @example
2477 @group
2478 ---------- Buffer: foo ----------
2479 This is the contents of the 5th foo.
2480 ---------- Buffer: foo ----------
2481 @end group
2482
2483 @group
2484 (capitalize-region 1 44)
2485 @result{} nil
2486
2487 ---------- Buffer: foo ----------
2488 This Is The Contents Of The 5th Foo.
2489 ---------- Buffer: foo ----------
2490 @end group
2491 @end example
2492 @end deffn
2493
2494 @deffn Command downcase-region start end
2495 This function converts all of the letters in the region defined by
2496 @var{start} and @var{end} to lower case. The function returns
2497 @code{nil}.
2498
2499 When @code{downcase-region} is called interactively, @var{start} and
2500 @var{end} are point and the mark, with the smallest first.
2501 @end deffn
2502
2503 @deffn Command upcase-region start end
2504 This function converts all of the letters in the region defined by
2505 @var{start} and @var{end} to upper case. The function returns
2506 @code{nil}.
2507
2508 When @code{upcase-region} is called interactively, @var{start} and
2509 @var{end} are point and the mark, with the smallest first.
2510 @end deffn
2511
2512 @deffn Command capitalize-word count
2513 This function capitalizes @var{count} words after point, moving point
2514 over as it does. To capitalize means to convert each word's first
2515 character to upper case and convert the rest of each word to lower case.
2516 If @var{count} is negative, the function capitalizes the
2517 @minus{}@var{count} previous words but does not move point. The value
2518 is @code{nil}.
2519
2520 If point is in the middle of a word, the part of the word before point
2521 is ignored when moving forward. The rest is treated as an entire word.
2522
2523 When @code{capitalize-word} is called interactively, @var{count} is
2524 set to the numeric prefix argument.
2525 @end deffn
2526
2527 @deffn Command downcase-word count
2528 This function converts the @var{count} words after point to all lower
2529 case, moving point over as it does. If @var{count} is negative, it
2530 converts the @minus{}@var{count} previous words but does not move point.
2531 The value is @code{nil}.
2532
2533 When @code{downcase-word} is called interactively, @var{count} is set
2534 to the numeric prefix argument.
2535 @end deffn
2536
2537 @deffn Command upcase-word count
2538 This function converts the @var{count} words after point to all upper
2539 case, moving point over as it does. If @var{count} is negative, it
2540 converts the @minus{}@var{count} previous words but does not move point.
2541 The value is @code{nil}.
2542
2543 When @code{upcase-word} is called interactively, @var{count} is set to
2544 the numeric prefix argument.
2545 @end deffn
2546
2547 @node Text Properties
2548 @section Text Properties
2549 @cindex text properties
2550 @cindex attributes of text
2551 @cindex properties of text
2552
2553 Each character position in a buffer or a string can have a @dfn{text
2554 property list}, much like the property list of a symbol (@pxref{Property
2555 Lists}). The properties belong to a particular character at a
2556 particular place, such as, the letter @samp{T} at the beginning of this
2557 sentence or the first @samp{o} in @samp{foo}---if the same character
2558 occurs in two different places, the two occurrences in general have
2559 different properties.
2560
2561 Each property has a name and a value. Both of these can be any Lisp
2562 object, but the name is normally a symbol. Typically each property
2563 name symbol is used for a particular purpose; for instance, the text
2564 property @code{face} specifies the faces for displaying the character
2565 (@pxref{Special Properties}). The usual way to access the property
2566 list is to specify a name and ask what value corresponds to it.
2567
2568 If a character has a @code{category} property, we call it the
2569 @dfn{property category} of the character. It should be a symbol. The
2570 properties of the symbol serve as defaults for the properties of the
2571 character.
2572
2573 Copying text between strings and buffers preserves the properties
2574 along with the characters; this includes such diverse functions as
2575 @code{substring}, @code{insert}, and @code{buffer-substring}.
2576
2577 @menu
2578 * Examining Properties:: Looking at the properties of one character.
2579 * Changing Properties:: Setting the properties of a range of text.
2580 * Property Search:: Searching for where a property changes value.
2581 * Special Properties:: Particular properties with special meanings.
2582 * Format Properties:: Properties for representing formatting of text.
2583 * Sticky Properties:: How inserted text gets properties from
2584 neighboring text.
2585 * Lazy Properties:: Computing text properties in a lazy fashion
2586 only when text is examined.
2587 * Clickable Text:: Using text properties to make regions of text
2588 do something when you click on them.
2589 * Fields:: The @code{field} property defines
2590 fields within the buffer.
2591 * Not Intervals:: Why text properties do not use
2592 Lisp-visible text intervals.
2593 @end menu
2594
2595 @node Examining Properties
2596 @subsection Examining Text Properties
2597
2598 The simplest way to examine text properties is to ask for the value of
2599 a particular property of a particular character. For that, use
2600 @code{get-text-property}. Use @code{text-properties-at} to get the
2601 entire property list of a character. @xref{Property Search}, for
2602 functions to examine the properties of a number of characters at once.
2603
2604 These functions handle both strings and buffers. Keep in mind that
2605 positions in a string start from 0, whereas positions in a buffer start
2606 from 1.
2607
2608 @defun get-text-property pos prop &optional object
2609 This function returns the value of the @var{prop} property of the
2610 character after position @var{pos} in @var{object} (a buffer or
2611 string). The argument @var{object} is optional and defaults to the
2612 current buffer.
2613
2614 If there is no @var{prop} property strictly speaking, but the character
2615 has a property category that is a symbol, then @code{get-text-property} returns
2616 the @var{prop} property of that symbol.
2617 @end defun
2618
2619 @defun get-char-property position prop &optional object
2620 This function is like @code{get-text-property}, except that it checks
2621 overlays first and then text properties. @xref{Overlays}.
2622
2623 The argument @var{object} may be a string, a buffer, or a window. If
2624 it is a window, then the buffer displayed in that window is used for
2625 text properties and overlays, but only the overlays active for that
2626 window are considered. If @var{object} is a buffer, then overlays in
2627 that buffer are considered first, in order of decreasing priority,
2628 followed by the text properties. If @var{object} is a string, only
2629 text properties are considered, since strings never have overlays.
2630 @end defun
2631
2632 @defun get-char-property-and-overlay position prop &optional object
2633 This is like @code{get-char-property}, but gives extra information
2634 about the overlay that the property value comes from.
2635
2636 Its value is a cons cell whose @sc{car} is the property value, the
2637 same value @code{get-char-property} would return with the same
2638 arguments. Its @sc{cdr} is the overlay in which the property was
2639 found, or @code{nil}, if it was found as a text property or not found
2640 at all.
2641
2642 If @var{position} is at the end of @var{object}, both the @sc{car} and
2643 the @sc{cdr} of the value are @code{nil}.
2644 @end defun
2645
2646 @defvar char-property-alias-alist
2647 This variable holds an alist which maps property names to a list of
2648 alternative property names. If a character does not specify a direct
2649 value for a property, the alternative property names are consulted in
2650 order; the first non-@code{nil} value is used. This variable takes
2651 precedence over @code{default-text-properties}, and @code{category}
2652 properties take precedence over this variable.
2653 @end defvar
2654
2655 @defun text-properties-at position &optional object
2656 This function returns the entire property list of the character at
2657 @var{position} in the string or buffer @var{object}. If @var{object} is
2658 @code{nil}, it defaults to the current buffer.
2659 @end defun
2660
2661 @defvar default-text-properties
2662 This variable holds a property list giving default values for text
2663 properties. Whenever a character does not specify a value for a
2664 property, neither directly, through a category symbol, or through
2665 @code{char-property-alias-alist}, the value stored in this list is
2666 used instead. Here is an example:
2667
2668 @example
2669 (setq default-text-properties '(foo 69)
2670 char-property-alias-alist nil)
2671 ;; @r{Make sure character 1 has no properties of its own.}
2672 (set-text-properties 1 2 nil)
2673 ;; @r{What we get, when we ask, is the default value.}
2674 (get-text-property 1 'foo)
2675 @result{} 69
2676 @end example
2677 @end defvar
2678
2679 @node Changing Properties
2680 @subsection Changing Text Properties
2681
2682 The primitives for changing properties apply to a specified range of
2683 text in a buffer or string. The function @code{set-text-properties}
2684 (see end of section) sets the entire property list of the text in that
2685 range; more often, it is useful to add, change, or delete just certain
2686 properties specified by name.
2687
2688 Since text properties are considered part of the contents of the
2689 buffer (or string), and can affect how a buffer looks on the screen,
2690 any change in buffer text properties marks the buffer as modified.
2691 Buffer text property changes are undoable also (@pxref{Undo}).
2692 Positions in a string start from 0, whereas positions in a buffer
2693 start from 1.
2694
2695 @defun put-text-property start end prop value &optional object
2696 This function sets the @var{prop} property to @var{value} for the text
2697 between @var{start} and @var{end} in the string or buffer @var{object}.
2698 If @var{object} is @code{nil}, it defaults to the current buffer.
2699 @end defun
2700
2701 @defun add-text-properties start end props &optional object
2702 This function adds or overrides text properties for the text between
2703 @var{start} and @var{end} in the string or buffer @var{object}. If
2704 @var{object} is @code{nil}, it defaults to the current buffer.
2705
2706 The argument @var{props} specifies which properties to add. It should
2707 have the form of a property list (@pxref{Property Lists}): a list whose
2708 elements include the property names followed alternately by the
2709 corresponding values.
2710
2711 The return value is @code{t} if the function actually changed some
2712 property's value; @code{nil} otherwise (if @var{props} is @code{nil} or
2713 its values agree with those in the text).
2714
2715 For example, here is how to set the @code{comment} and @code{face}
2716 properties of a range of text:
2717
2718 @example
2719 (add-text-properties @var{start} @var{end}
2720 '(comment t face highlight))
2721 @end example
2722 @end defun
2723
2724 @defun remove-text-properties start end props &optional object
2725 This function deletes specified text properties from the text between
2726 @var{start} and @var{end} in the string or buffer @var{object}. If
2727 @var{object} is @code{nil}, it defaults to the current buffer.
2728
2729 The argument @var{props} specifies which properties to delete. It
2730 should have the form of a property list (@pxref{Property Lists}): a list
2731 whose elements are property names alternating with corresponding values.
2732 But only the names matter---the values that accompany them are ignored.
2733 For example, here's how to remove the @code{face} property.
2734
2735 @example
2736 (remove-text-properties @var{start} @var{end} '(face nil))
2737 @end example
2738
2739 The return value is @code{t} if the function actually changed some
2740 property's value; @code{nil} otherwise (if @var{props} is @code{nil} or
2741 if no character in the specified text had any of those properties).
2742
2743 To remove all text properties from certain text, use
2744 @code{set-text-properties} and specify @code{nil} for the new property
2745 list.
2746 @end defun
2747
2748 @defun remove-list-of-text-properties start end list-of-properties &optional object
2749 Like @code{remove-text-properties} except that
2750 @var{list-of-properties} is a list of property names only, not an
2751 alternating list of property names and values.
2752 @end defun
2753
2754 @defun set-text-properties start end props &optional object
2755 This function completely replaces the text property list for the text
2756 between @var{start} and @var{end} in the string or buffer @var{object}.
2757 If @var{object} is @code{nil}, it defaults to the current buffer.
2758
2759 The argument @var{props} is the new property list. It should be a list
2760 whose elements are property names alternating with corresponding values.
2761
2762 After @code{set-text-properties} returns, all the characters in the
2763 specified range have identical properties.
2764
2765 If @var{props} is @code{nil}, the effect is to get rid of all properties
2766 from the specified range of text. Here's an example:
2767
2768 @example
2769 (set-text-properties @var{start} @var{end} nil)
2770 @end example
2771
2772 Do not rely on the return value of this function.
2773 @end defun
2774
2775 The easiest way to make a string with text properties
2776 is with @code{propertize}:
2777
2778 @defun propertize string &rest properties
2779 This function returns a copy of @var{string} which has the text
2780 properties @var{properties}. These properties apply to all the
2781 characters in the string that is returned. Here is an example that
2782 constructs a string with a @code{face} property and a @code{mouse-face}
2783 property:
2784
2785 @smallexample
2786 (propertize "foo" 'face 'italic
2787 'mouse-face 'bold-italic)
2788 @result{} #("foo" 0 3 (mouse-face bold-italic face italic))
2789 @end smallexample
2790
2791 To put different properties on various parts of a string, you can
2792 construct each part with @code{propertize} and then combine them with
2793 @code{concat}:
2794
2795 @smallexample
2796 (concat
2797 (propertize "foo" 'face 'italic
2798 'mouse-face 'bold-italic)
2799 " and "
2800 (propertize "bar" 'face 'italic
2801 'mouse-face 'bold-italic))
2802 @result{} #("foo and bar"
2803 0 3 (face italic mouse-face bold-italic)
2804 3 8 nil
2805 8 11 (face italic mouse-face bold-italic))
2806 @end smallexample
2807 @end defun
2808
2809 See also the function @code{buffer-substring-no-properties}
2810 (@pxref{Buffer Contents}) which copies text from the buffer
2811 but does not copy its properties.
2812
2813 @node Property Search
2814 @subsection Text Property Search Functions
2815
2816 In typical use of text properties, most of the time several or many
2817 consecutive characters have the same value for a property. Rather than
2818 writing your programs to examine characters one by one, it is much
2819 faster to process chunks of text that have the same property value.
2820
2821 Here are functions you can use to do this. They use @code{eq} for
2822 comparing property values. In all cases, @var{object} defaults to the
2823 current buffer.
2824
2825 For high performance, it's very important to use the @var{limit}
2826 argument to these functions, especially the ones that search for a
2827 single property---otherwise, they may spend a long time scanning to the
2828 end of the buffer, if the property you are interested in does not change.
2829
2830 These functions do not move point; instead, they return a position (or
2831 @code{nil}). Remember that a position is always between two characters;
2832 the position returned by these functions is between two characters with
2833 different properties.
2834
2835 @defun next-property-change pos &optional object limit
2836 The function scans the text forward from position @var{pos} in the
2837 string or buffer @var{object} till it finds a change in some text
2838 property, then returns the position of the change. In other words, it
2839 returns the position of the first character beyond @var{pos} whose
2840 properties are not identical to those of the character just after
2841 @var{pos}.
2842
2843 If @var{limit} is non-@code{nil}, then the scan ends at position
2844 @var{limit}. If there is no property change before that point,
2845 @code{next-property-change} returns @var{limit}.
2846
2847 The value is @code{nil} if the properties remain unchanged all the way
2848 to the end of @var{object} and @var{limit} is @code{nil}. If the value
2849 is non-@code{nil}, it is a position greater than or equal to @var{pos}.
2850 The value equals @var{pos} only when @var{limit} equals @var{pos}.
2851
2852 Here is an example of how to scan the buffer by chunks of text within
2853 which all properties are constant:
2854
2855 @smallexample
2856 (while (not (eobp))
2857 (let ((plist (text-properties-at (point)))
2858 (next-change
2859 (or (next-property-change (point) (current-buffer))
2860 (point-max))))
2861 @r{Process text from point to @var{next-change}@dots{}}
2862 (goto-char next-change)))
2863 @end smallexample
2864 @end defun
2865
2866 @defun previous-property-change pos &optional object limit
2867 This is like @code{next-property-change}, but scans back from @var{pos}
2868 instead of forward. If the value is non-@code{nil}, it is a position
2869 less than or equal to @var{pos}; it equals @var{pos} only if @var{limit}
2870 equals @var{pos}.
2871 @end defun
2872
2873 @defun next-single-property-change pos prop &optional object limit
2874 The function scans text for a change in the @var{prop} property, then
2875 returns the position of the change. The scan goes forward from
2876 position @var{pos} in the string or buffer @var{object}. In other
2877 words, this function returns the position of the first character
2878 beyond @var{pos} whose @var{prop} property differs from that of the
2879 character just after @var{pos}.
2880
2881 If @var{limit} is non-@code{nil}, then the scan ends at position
2882 @var{limit}. If there is no property change before that point,
2883 @code{next-single-property-change} returns @var{limit}.
2884
2885 The value is @code{nil} if the property remains unchanged all the way to
2886 the end of @var{object} and @var{limit} is @code{nil}. If the value is
2887 non-@code{nil}, it is a position greater than or equal to @var{pos}; it
2888 equals @var{pos} only if @var{limit} equals @var{pos}.
2889 @end defun
2890
2891 @defun previous-single-property-change pos prop &optional object limit
2892 This is like @code{next-single-property-change}, but scans back from
2893 @var{pos} instead of forward. If the value is non-@code{nil}, it is a
2894 position less than or equal to @var{pos}; it equals @var{pos} only if
2895 @var{limit} equals @var{pos}.
2896 @end defun
2897
2898 @defun next-char-property-change pos &optional limit
2899 This is like @code{next-property-change} except that it considers
2900 overlay properties as well as text properties, and if no change is
2901 found before the end of the buffer, it returns the maximum buffer
2902 position rather than @code{nil} (in this sense, it resembles the
2903 corresponding overlay function @code{next-overlay-change}, rather than
2904 @code{next-property-change}). There is no @var{object} operand
2905 because this function operates only on the current buffer. It returns
2906 the next address at which either kind of property changes.
2907 @end defun
2908
2909 @defun previous-char-property-change pos &optional limit
2910 This is like @code{next-char-property-change}, but scans back from
2911 @var{pos} instead of forward, and returns the minimum buffer
2912 position if no change is found.
2913 @end defun
2914
2915 @defun next-single-char-property-change pos prop &optional object limit
2916 This is like @code{next-single-property-change} except that it
2917 considers overlay properties as well as text properties, and if no
2918 change is found before the end of the @var{object}, it returns the
2919 maximum valid position in @var{object} rather than @code{nil}. Unlike
2920 @code{next-char-property-change}, this function @emph{does} have an
2921 @var{object} operand; if @var{object} is not a buffer, only
2922 text-properties are considered.
2923 @end defun
2924
2925 @defun previous-single-char-property-change pos prop &optional object limit
2926 This is like @code{next-single-char-property-change}, but scans back
2927 from @var{pos} instead of forward, and returns the minimum valid
2928 position in @var{object} if no change is found.
2929 @end defun
2930
2931 @defun text-property-any start end prop value &optional object
2932 This function returns non-@code{nil} if at least one character between
2933 @var{start} and @var{end} has a property @var{prop} whose value is
2934 @var{value}. More precisely, it returns the position of the first such
2935 character. Otherwise, it returns @code{nil}.
2936
2937 The optional fifth argument, @var{object}, specifies the string or
2938 buffer to scan. Positions are relative to @var{object}. The default
2939 for @var{object} is the current buffer.
2940 @end defun
2941
2942 @defun text-property-not-all start end prop value &optional object
2943 This function returns non-@code{nil} if at least one character between
2944 @var{start} and @var{end} does not have a property @var{prop} with value
2945 @var{value}. More precisely, it returns the position of the first such
2946 character. Otherwise, it returns @code{nil}.
2947
2948 The optional fifth argument, @var{object}, specifies the string or
2949 buffer to scan. Positions are relative to @var{object}. The default
2950 for @var{object} is the current buffer.
2951 @end defun
2952
2953 @node Special Properties
2954 @subsection Properties with Special Meanings
2955
2956 Here is a table of text property names that have special built-in
2957 meanings. The following sections list a few additional special property
2958 names that control filling and property inheritance. All other names
2959 have no standard meaning, and you can use them as you like.
2960
2961 Note: the properties @code{composition}, @code{display},
2962 @code{invisible} and @code{intangible} can also cause point to move to
2963 an acceptable place, after each Emacs command. @xref{Adjusting
2964 Point}.
2965
2966 @table @code
2967 @cindex property category of text character
2968 @kindex category @r{(text property)}
2969 @item category
2970 If a character has a @code{category} property, we call it the
2971 @dfn{property category} of the character. It should be a symbol. The
2972 properties of this symbol serve as defaults for the properties of the
2973 character.
2974
2975 @item face
2976 @cindex face codes of text
2977 @kindex face @r{(text property)}
2978 You can use the property @code{face} to control the font and color of
2979 text. @xref{Faces}, for more information.
2980
2981 In the simplest case, the value is a face name. It can also be a list;
2982 then each element can be any of these possibilities;
2983
2984 @itemize @bullet
2985 @item
2986 A face name (a symbol or string).
2987
2988 @item
2989 A property list of face attributes. This has the
2990 form (@var{keyword} @var{value} @dots{}), where each @var{keyword} is a
2991 face attribute name and @var{value} is a meaningful value for that
2992 attribute. With this feature, you do not need to create a face each
2993 time you want to specify a particular attribute for certain text.
2994 @xref{Face Attributes}.
2995
2996 @item
2997 A cons cell with the form @code{(foreground-color . @var{color-name})}
2998 or @code{(background-color . @var{color-name})}. These are old,
2999 deprecated equivalents for @code{(:foreground @var{color-name})} and
3000 @code{(:background @var{color-name})}. Please convert code that uses
3001 them.
3002 @end itemize
3003
3004 It works to use the latter two forms directly as the value
3005 of the @code{face} property.
3006
3007 Font Lock mode (@pxref{Font Lock Mode}) works in most buffers by
3008 dynamically updating the @code{face} property of characters based on
3009 the context.
3010
3011 @item font-lock-face
3012 @kindex font-lock-face @r{(text property)}
3013 The @code{font-lock-face} property is equivalent to the @code{face}
3014 property when Font Lock mode is enabled. When Font Lock mode is disabled,
3015 @code{font-lock-face} has no effect.
3016
3017 The @code{font-lock-face} property is useful for special modes that
3018 implement their own highlighting. @xref{Precalculated Fontification}.
3019
3020 @item mouse-face
3021 @kindex mouse-face @r{(text property)}
3022 The property @code{mouse-face} is used instead of @code{face} when the
3023 mouse is on or near the character. For this purpose, ``near'' means
3024 that all text between the character and where the mouse is have the same
3025 @code{mouse-face} property value.
3026
3027 @item fontified
3028 @kindex fontified @r{(text property)}
3029 This property says whether the text is ready for display. If
3030 @code{nil}, Emacs's redisplay routine calls the functions in
3031 @code{fontification-functions} (@pxref{Auto Faces}) to prepare this
3032 part of the buffer before it is displayed. It is used internally by
3033 the ``just in time'' font locking code.
3034
3035 @item display
3036 This property activates various features that change the
3037 way text is displayed. For example, it can make text appear taller
3038 or shorter, higher or lower, wider or narrow, or replaced with an image.
3039 @xref{Display Property}.
3040
3041 @item help-echo
3042 @kindex help-echo @r{(text property)}
3043 @cindex tooltip
3044 @anchor{Text help-echo}
3045 If text has a string as its @code{help-echo} property, then when you
3046 move the mouse onto that text, Emacs displays that string in the echo
3047 area, or in the tooltip window (@pxref{Tooltips,,, emacs, The GNU Emacs
3048 Manual}).
3049
3050 If the value of the @code{help-echo} property is a function, that
3051 function is called with three arguments, @var{window}, @var{object} and
3052 @var{pos} and should return a help string or @code{nil} for
3053 none. The first argument, @var{window} is the window in which
3054 the help was found. The second, @var{object}, is the buffer, overlay or
3055 string which had the @code{help-echo} property. The @var{pos}
3056 argument is as follows:
3057
3058 @itemize @bullet{}
3059 @item
3060 If @var{object} is a buffer, @var{pos} is the position in the buffer.
3061 @item
3062 If @var{object} is an overlay, that overlay has a @code{help-echo}
3063 property, and @var{pos} is the position in the overlay's buffer.
3064 @item
3065 If @var{object} is a string (an overlay string or a string displayed
3066 with the @code{display} property), @var{pos} is the position in that
3067 string.
3068 @end itemize
3069
3070 If the value of the @code{help-echo} property is neither a function nor
3071 a string, it is evaluated to obtain a help string.
3072
3073 You can alter the way help text is displayed by setting the variable
3074 @code{show-help-function} (@pxref{Help display}).
3075
3076 This feature is used in the mode line and for other active text.
3077
3078 @item keymap
3079 @cindex keymap of character
3080 @kindex keymap @r{(text property)}
3081 The @code{keymap} property specifies an additional keymap for
3082 commands. When this keymap applies, it is used for key lookup before
3083 the minor mode keymaps and before the buffer's local map.
3084 @xref{Active Keymaps}. If the property value is a symbol, the
3085 symbol's function definition is used as the keymap.
3086
3087 The property's value for the character before point applies if it is
3088 non-@code{nil} and rear-sticky, and the property's value for the
3089 character after point applies if it is non-@code{nil} and
3090 front-sticky. (For mouse clicks, the position of the click is used
3091 instead of the position of point.)
3092
3093 @item local-map
3094 @kindex local-map @r{(text property)}
3095 This property works like @code{keymap} except that it specifies a
3096 keymap to use @emph{instead of} the buffer's local map. For most
3097 purposes (perhaps all purposes), it is better to use the @code{keymap}
3098 property.
3099
3100 @item syntax-table
3101 The @code{syntax-table} property overrides what the syntax table says
3102 about this particular character. @xref{Syntax Properties}.
3103
3104 @item read-only
3105 @cindex read-only character
3106 @kindex read-only @r{(text property)}
3107 If a character has the property @code{read-only}, then modifying that
3108 character is not allowed. Any command that would do so gets an error,
3109 @code{text-read-only}. If the property value is a string, that string
3110 is used as the error message.
3111
3112 Insertion next to a read-only character is an error if inserting
3113 ordinary text there would inherit the @code{read-only} property due to
3114 stickiness. Thus, you can control permission to insert next to
3115 read-only text by controlling the stickiness. @xref{Sticky Properties}.
3116
3117 Since changing properties counts as modifying the buffer, it is not
3118 possible to remove a @code{read-only} property unless you know the
3119 special trick: bind @code{inhibit-read-only} to a non-@code{nil} value
3120 and then remove the property. @xref{Read Only Buffers}.
3121
3122 @item invisible
3123 @kindex invisible @r{(text property)}
3124 A non-@code{nil} @code{invisible} property can make a character invisible
3125 on the screen. @xref{Invisible Text}, for details.
3126
3127 @item intangible
3128 @kindex intangible @r{(text property)}
3129 If a group of consecutive characters have equal and non-@code{nil}
3130 @code{intangible} properties, then you cannot place point between them.
3131 If you try to move point forward into the group, point actually moves to
3132 the end of the group. If you try to move point backward into the group,
3133 point actually moves to the start of the group.
3134
3135 If consecutive characters have unequal non-@code{nil}
3136 @code{intangible} properties, they belong to separate groups; each
3137 group is separately treated as described above.
3138
3139 When the variable @code{inhibit-point-motion-hooks} is non-@code{nil},
3140 the @code{intangible} property is ignored.
3141
3142 @item field
3143 @kindex field @r{(text property)}
3144 Consecutive characters with the same @code{field} property constitute a
3145 @dfn{field}. Some motion functions including @code{forward-word} and
3146 @code{beginning-of-line} stop moving at a field boundary.
3147 @xref{Fields}.
3148
3149 @item cursor
3150 @kindex cursor @r{(text property)}
3151 Normally, the cursor is displayed at the end of any overlay and text
3152 property strings present at the current buffer position. You can
3153 place the cursor on any desired character of these strings by giving
3154 that character a non-@code{nil} @code{cursor} text property. In
3155 addition, if the value of the @code{cursor} property of an overlay
3156 string is an integer number, it specifies the number of buffer's
3157 character positions associated with the overlay string; this way,
3158 Emacs will display the cursor on the character with that property
3159 regardless of whether the current buffer position is actually covered
3160 by the overlay. Specifically, if the value of the @code{cursor}
3161 property of a character is the number @var{n}, the cursor will be
3162 displayed on this character for any buffer position in the range
3163 @code{[@var{ovpos}..@var{ovpos}+@var{n}]}, where @var{ovpos} is the
3164 starting buffer position covered by the overlay (@pxref{Managing
3165 Overlays}).
3166
3167 @item pointer
3168 @kindex pointer @r{(text property)}
3169 This specifies a specific pointer shape when the mouse pointer is over
3170 this text or image. @xref{Pointer Shape}, for possible pointer
3171 shapes.
3172
3173 @item line-spacing
3174 @kindex line-spacing @r{(text property)}
3175 A newline can have a @code{line-spacing} text or overlay property that
3176 controls the height of the display line ending with that newline. The
3177 property value overrides the default frame line spacing and the buffer
3178 local @code{line-spacing} variable. @xref{Line Height}.
3179
3180 @item line-height
3181 @kindex line-height @r{(text property)}
3182 A newline can have a @code{line-height} text or overlay property that
3183 controls the total height of the display line ending in that newline.
3184 @xref{Line Height}.
3185
3186 @item wrap-prefix
3187 If text has a @code{wrap-prefix} property, the prefix it defines will
3188 be added at display-time to the beginning of every continuation line
3189 due to text wrapping (so if lines are truncated, the wrap-prefix is
3190 never used). It may be a string, an image, or a stretch-glyph such as
3191 used by the @code{display} text-property. @xref{Display Property}.
3192
3193 A wrap-prefix may also be specified for an entire buffer using the
3194 @code{wrap-prefix} buffer-local variable (however, a
3195 @code{wrap-prefix} text-property takes precedence over the value of
3196 the @code{wrap-prefix} variable). @xref{Truncation}.
3197
3198 @item line-prefix
3199 If text has a @code{line-prefix} property, the prefix it defines will
3200 be added at display-time to the beginning of every non-continuation
3201 line. It may be a string, an image, or a stretch-glyph such as used
3202 by the @code{display} text-property. @xref{Display Property}.
3203
3204 A line-prefix may also be specified for an entire buffer using the
3205 @code{line-prefix} buffer-local variable (however, a
3206 @code{line-prefix} text-property takes precedence over the value of
3207 the @code{line-prefix} variable). @xref{Truncation}.
3208
3209 @item modification-hooks
3210 @cindex change hooks for a character
3211 @cindex hooks for changing a character
3212 @kindex modification-hooks @r{(text property)}
3213 If a character has the property @code{modification-hooks}, then its
3214 value should be a list of functions; modifying that character calls
3215 all of those functions before the actual modification. Each function
3216 receives two arguments: the beginning and end of the part of the
3217 buffer being modified. Note that if a particular modification hook
3218 function appears on several characters being modified by a single
3219 primitive, you can't predict how many times the function will
3220 be called.
3221 Furthermore, insertion will not modify any existing character, so this
3222 hook will only be run when removing some characters, replacing them
3223 with others, or changing their text-properties.
3224
3225 If these functions modify the buffer, they should bind
3226 @code{inhibit-modification-hooks} to @code{t} around doing so, to
3227 avoid confusing the internal mechanism that calls these hooks.
3228
3229 Overlays also support the @code{modification-hooks} property, but the
3230 details are somewhat different (@pxref{Overlay Properties}).
3231
3232 @item insert-in-front-hooks
3233 @itemx insert-behind-hooks
3234 @kindex insert-in-front-hooks @r{(text property)}
3235 @kindex insert-behind-hooks @r{(text property)}
3236 The operation of inserting text in a buffer also calls the functions
3237 listed in the @code{insert-in-front-hooks} property of the following
3238 character and in the @code{insert-behind-hooks} property of the
3239 preceding character. These functions receive two arguments, the
3240 beginning and end of the inserted text. The functions are called
3241 @emph{after} the actual insertion takes place.
3242
3243 See also @ref{Change Hooks}, for other hooks that are called
3244 when you change text in a buffer.
3245
3246 @item point-entered
3247 @itemx point-left
3248 @cindex hooks for motion of point
3249 @kindex point-entered @r{(text property)}
3250 @kindex point-left @r{(text property)}
3251 The special properties @code{point-entered} and @code{point-left}
3252 record hook functions that report motion of point. Each time point
3253 moves, Emacs compares these two property values:
3254
3255 @itemize @bullet
3256 @item
3257 the @code{point-left} property of the character after the old location,
3258 and
3259 @item
3260 the @code{point-entered} property of the character after the new
3261 location.
3262 @end itemize
3263
3264 @noindent
3265 If these two values differ, each of them is called (if not @code{nil})
3266 with two arguments: the old value of point, and the new one.
3267
3268 The same comparison is made for the characters before the old and new
3269 locations. The result may be to execute two @code{point-left} functions
3270 (which may be the same function) and/or two @code{point-entered}
3271 functions (which may be the same function). In any case, all the
3272 @code{point-left} functions are called first, followed by all the
3273 @code{point-entered} functions.
3274
3275 It is possible with @code{char-after} to examine characters at various
3276 buffer positions without moving point to those positions. Only an
3277 actual change in the value of point runs these hook functions.
3278
3279 @defvar inhibit-point-motion-hooks
3280 When this variable is non-@code{nil}, @code{point-left} and
3281 @code{point-entered} hooks are not run, and the @code{intangible}
3282 property has no effect. Do not set this variable globally; bind it with
3283 @code{let}.
3284 @end defvar
3285
3286 @defvar show-help-function
3287 @anchor{Help display} If this variable is non-@code{nil}, it specifies a
3288 function called to display help strings. These may be @code{help-echo}
3289 properties, menu help strings (@pxref{Simple Menu Items},
3290 @pxref{Extended Menu Items}), or tool bar help strings (@pxref{Tool
3291 Bar}). The specified function is called with one argument, the help
3292 string to display. Tooltip mode (@pxref{Tooltips,,, emacs, The GNU Emacs
3293 Manual}) provides an example.
3294 @end defvar
3295
3296 @item composition
3297 @kindex composition @r{(text property)}
3298 This text property is used to display a sequence of characters as a
3299 single glyph composed from components. But the value of the property
3300 itself is completely internal to Emacs and should not be manipulated
3301 directly by, for instance, @code{put-text-property}.
3302
3303 @end table
3304
3305 @node Format Properties
3306 @subsection Formatted Text Properties
3307
3308 These text properties affect the behavior of the fill commands. They
3309 are used for representing formatted text. @xref{Filling}, and
3310 @ref{Margins}.
3311
3312 @table @code
3313 @item hard
3314 If a newline character has this property, it is a ``hard'' newline.
3315 The fill commands do not alter hard newlines and do not move words
3316 across them. However, this property takes effect only if the
3317 @code{use-hard-newlines} minor mode is enabled. @xref{Hard and Soft
3318 Newlines,, Hard and Soft Newlines, emacs, The GNU Emacs Manual}.
3319
3320 @item right-margin
3321 This property specifies an extra right margin for filling this part of the
3322 text.
3323
3324 @item left-margin
3325 This property specifies an extra left margin for filling this part of the
3326 text.
3327
3328 @item justification
3329 This property specifies the style of justification for filling this part
3330 of the text.
3331 @end table
3332
3333 @node Sticky Properties
3334 @subsection Stickiness of Text Properties
3335 @cindex sticky text properties
3336 @cindex inheritance of text properties
3337
3338 Self-inserting characters normally take on the same properties as the
3339 preceding character. This is called @dfn{inheritance} of properties.
3340
3341 In a Lisp program, you can do insertion with inheritance or without,
3342 depending on your choice of insertion primitive. The ordinary text
3343 insertion functions such as @code{insert} do not inherit any properties.
3344 They insert text with precisely the properties of the string being
3345 inserted, and no others. This is correct for programs that copy text
3346 from one context to another---for example, into or out of the kill ring.
3347 To insert with inheritance, use the special primitives described in this
3348 section. Self-inserting characters inherit properties because they work
3349 using these primitives.
3350
3351 When you do insertion with inheritance, @emph{which} properties are
3352 inherited, and from where, depends on which properties are @dfn{sticky}.
3353 Insertion after a character inherits those of its properties that are
3354 @dfn{rear-sticky}. Insertion before a character inherits those of its
3355 properties that are @dfn{front-sticky}. When both sides offer different
3356 sticky values for the same property, the previous character's value
3357 takes precedence.
3358
3359 By default, a text property is rear-sticky but not front-sticky; thus,
3360 the default is to inherit all the properties of the preceding character,
3361 and nothing from the following character.
3362
3363 You can control the stickiness of various text properties with two
3364 specific text properties, @code{front-sticky} and @code{rear-nonsticky},
3365 and with the variable @code{text-property-default-nonsticky}. You can
3366 use the variable to specify a different default for a given property.
3367 You can use those two text properties to make any specific properties
3368 sticky or nonsticky in any particular part of the text.
3369
3370 If a character's @code{front-sticky} property is @code{t}, then all
3371 its properties are front-sticky. If the @code{front-sticky} property is
3372 a list, then the sticky properties of the character are those whose
3373 names are in the list. For example, if a character has a
3374 @code{front-sticky} property whose value is @code{(face read-only)},
3375 then insertion before the character can inherit its @code{face} property
3376 and its @code{read-only} property, but no others.
3377
3378 The @code{rear-nonsticky} property works the opposite way. Most
3379 properties are rear-sticky by default, so the @code{rear-nonsticky}
3380 property says which properties are @emph{not} rear-sticky. If a
3381 character's @code{rear-nonsticky} property is @code{t}, then none of its
3382 properties are rear-sticky. If the @code{rear-nonsticky} property is a
3383 list, properties are rear-sticky @emph{unless} their names are in the
3384 list.
3385
3386 @defvar text-property-default-nonsticky
3387 This variable holds an alist which defines the default rear-stickiness
3388 of various text properties. Each element has the form
3389 @code{(@var{property} . @var{nonstickiness})}, and it defines the
3390 stickiness of a particular text property, @var{property}.
3391
3392 If @var{nonstickiness} is non-@code{nil}, this means that the property
3393 @var{property} is rear-nonsticky by default. Since all properties are
3394 front-nonsticky by default, this makes @var{property} nonsticky in both
3395 directions by default.
3396
3397 The text properties @code{front-sticky} and @code{rear-nonsticky}, when
3398 used, take precedence over the default @var{nonstickiness} specified in
3399 @code{text-property-default-nonsticky}.
3400 @end defvar
3401
3402 Here are the functions that insert text with inheritance of properties:
3403
3404 @defun insert-and-inherit &rest strings
3405 Insert the strings @var{strings}, just like the function @code{insert},
3406 but inherit any sticky properties from the adjoining text.
3407 @end defun
3408
3409 @defun insert-before-markers-and-inherit &rest strings
3410 Insert the strings @var{strings}, just like the function
3411 @code{insert-before-markers}, but inherit any sticky properties from the
3412 adjoining text.
3413 @end defun
3414
3415 @xref{Insertion}, for the ordinary insertion functions which do not
3416 inherit.
3417
3418 @node Lazy Properties
3419 @subsection Lazy Computation of Text Properties
3420
3421 Instead of computing text properties for all the text in the buffer,
3422 you can arrange to compute the text properties for parts of the text
3423 when and if something depends on them.
3424
3425 The primitive that extracts text from the buffer along with its
3426 properties is @code{buffer-substring}. Before examining the properties,
3427 this function runs the abnormal hook @code{buffer-access-fontify-functions}.
3428
3429 @defvar buffer-access-fontify-functions
3430 This variable holds a list of functions for computing text properties.
3431 Before @code{buffer-substring} copies the text and text properties for a
3432 portion of the buffer, it calls all the functions in this list. Each of
3433 the functions receives two arguments that specify the range of the
3434 buffer being accessed. (The buffer itself is always the current
3435 buffer.)
3436 @end defvar
3437
3438 The function @code{buffer-substring-no-properties} does not call these
3439 functions, since it ignores text properties anyway.
3440
3441 In order to prevent the hook functions from being called more than
3442 once for the same part of the buffer, you can use the variable
3443 @code{buffer-access-fontified-property}.
3444
3445 @defvar buffer-access-fontified-property
3446 If this variable's value is non-@code{nil}, it is a symbol which is used
3447 as a text property name. A non-@code{nil} value for that text property
3448 means, ``the other text properties for this character have already been
3449 computed.''
3450
3451 If all the characters in the range specified for @code{buffer-substring}
3452 have a non-@code{nil} value for this property, @code{buffer-substring}
3453 does not call the @code{buffer-access-fontify-functions} functions. It
3454 assumes these characters already have the right text properties, and
3455 just copies the properties they already have.
3456
3457 The normal way to use this feature is that the
3458 @code{buffer-access-fontify-functions} functions add this property, as
3459 well as others, to the characters they operate on. That way, they avoid
3460 being called over and over for the same text.
3461 @end defvar
3462
3463 @node Clickable Text
3464 @subsection Defining Clickable Text
3465 @cindex clickable text
3466 @cindex follow links
3467 @cindex mouse-1
3468
3469 @dfn{Clickable text} is text that can be clicked, with either the
3470 mouse or via a keyboard command, to produce some result. Many major
3471 modes use clickable text to implement textual hyper-links, or
3472 @dfn{links} for short.
3473
3474 The easiest way to insert and manipulate links is to use the
3475 @code{button} package. @xref{Buttons}. In this section, we will
3476 explain how to manually set up clickable text in a buffer, using text
3477 properties. For simplicity, we will refer to the clickable text as a
3478 @dfn{link}.
3479
3480 Implementing a link involves three separate steps: (1) indicating
3481 clickability when the mouse moves over the link; (2) making @kbd{RET}
3482 or @kbd{Mouse-2} on that link do something; and (3) setting up a
3483 @code{follow-link} condition so that the link obeys
3484 @code{mouse-1-click-follows-link}.
3485
3486 To indicate clickability, add the @code{mouse-face} text property to
3487 the text of the link; then Emacs will highlight the link when the
3488 mouse moves over it. In addition, you should define a tooltip or echo
3489 area message, using the @code{help-echo} text property. @xref{Special
3490 Properties}. For instance, here is how Dired indicates that file
3491 names are clickable:
3492
3493 @smallexample
3494 (if (dired-move-to-filename)
3495 (add-text-properties
3496 (point)
3497 (save-excursion
3498 (dired-move-to-end-of-filename)
3499 (point))
3500 '(mouse-face highlight
3501 help-echo "mouse-2: visit this file in other window")))
3502 @end smallexample
3503
3504 To make the link clickable, bind @key{RET} and @kbd{Mouse-2} to
3505 commands that perform the desired action. Each command should check
3506 to see whether it was called on a link, and act accordingly. For
3507 instance, Dired's major mode keymap binds @kbd{Mouse-2} to the
3508 following command:
3509
3510 @smallexample
3511 (defun dired-mouse-find-file-other-window (event)
3512 "In Dired, visit the file or directory name you click on."
3513 (interactive "e")
3514 (let ((window (posn-window (event-end event)))
3515 (pos (posn-point (event-end event)))
3516 file)
3517 (if (not (windowp window))
3518 (error "No file chosen"))
3519 (with-current-buffer (window-buffer window)
3520 (goto-char pos)
3521 (setq file (dired-get-file-for-visit)))
3522 (if (file-directory-p file)
3523 (or (and (cdr dired-subdir-alist)
3524 (dired-goto-subdir file))
3525 (progn
3526 (select-window window)
3527 (dired-other-window file)))
3528 (select-window window)
3529 (find-file-other-window (file-name-sans-versions file t)))))
3530 @end smallexample
3531
3532 @noindent
3533 This command uses the functions @code{posn-window} and
3534 @code{posn-point} to determine where the click occurred, and
3535 @code{dired-get-file-for-visit} to determine which file to visit.
3536
3537 Instead of binding the mouse command in a major mode keymap, you can
3538 bind it within the link text, using the @code{keymap} text property
3539 (@pxref{Special Properties}). For instance:
3540
3541 @example
3542 (let ((map (make-sparse-keymap)))
3543 (define-key map [mouse-2] 'operate-this-button)
3544 (put-text-property link-start link-end 'keymap map))
3545 @end example
3546
3547 @noindent
3548 With this method, you can easily define different commands for
3549 different links. Furthermore, the global definition of @key{RET} and
3550 @kbd{Mouse-2} remain available for the rest of the text in the buffer.
3551
3552 @vindex mouse-1-click-follows-link
3553 The basic Emacs command for clicking on links is @kbd{Mouse-2}.
3554 However, for compatibility with other graphical applications, Emacs
3555 also recognizes @kbd{Mouse-1} clicks on links, provided the user
3556 clicks on the link quickly without moving the mouse. This behavior is
3557 controlled by the user option @code{mouse-1-click-follows-link}.
3558 @xref{Mouse References,,, emacs, The GNU Emacs Manual}.
3559
3560 To set up the link so that it obeys
3561 @code{mouse-1-click-follows-link}, you must either (1) apply a
3562 @code{follow-link} text or overlay property to the link text, or (2)
3563 bind the @code{follow-link} event to a keymap (which can be a major
3564 mode keymap or a local keymap specified via the @code{keymap} text
3565 property). The value of the @code{follow-link} property, or the
3566 binding for the @code{follow-link} event, acts as a ``condition'' for
3567 the link action. This condition tells Emacs two things: the
3568 circumstances under which a @kbd{Mouse-1} click should be regarded as
3569 occurring ``inside'' the link, and how to compute an ``action code''
3570 that says what to translate the @kbd{Mouse-1} click into. The link
3571 action condition can be one of the following:
3572
3573 @table @asis
3574 @item @code{mouse-face}
3575 If the condition is the symbol @code{mouse-face}, a position is inside
3576 a link if there is a non-@code{nil} @code{mouse-face} property at that
3577 position. The action code is always @code{t}.
3578
3579 For example, here is how Info mode handles @key{Mouse-1}:
3580
3581 @smallexample
3582 (define-key Info-mode-map [follow-link] 'mouse-face)
3583 @end smallexample
3584
3585 @item a function
3586 If the condition is a function, @var{func}, then a position @var{pos}
3587 is inside a link if @code{(@var{func} @var{pos})} evaluates to
3588 non-@code{nil}. The value returned by @var{func} serves as the action
3589 code.
3590
3591 For example, here is how pcvs enables @kbd{Mouse-1} to follow links on
3592 file names only:
3593
3594 @smallexample
3595 (define-key map [follow-link]
3596 (lambda (pos)
3597 (eq (get-char-property pos 'face) 'cvs-filename-face)))
3598 @end smallexample
3599
3600 @item anything else
3601 If the condition value is anything else, then the position is inside a
3602 link and the condition itself is the action code. Clearly, you should
3603 specify this kind of condition only when applying the condition via a
3604 text or property overlay on the link text (so that it does not apply
3605 to the entire buffer).
3606 @end table
3607
3608 @noindent
3609 The action code tells @kbd{Mouse-1} how to follow the link:
3610
3611 @table @asis
3612 @item a string or vector
3613 If the action code is a string or vector, the @kbd{Mouse-1} event is
3614 translated into the first element of the string or vector; i.e., the
3615 action of the @kbd{Mouse-1} click is the local or global binding of
3616 that character or symbol. Thus, if the action code is @code{"foo"},
3617 @kbd{Mouse-1} translates into @kbd{f}. If it is @code{[foo]},
3618 @kbd{Mouse-1} translates into @key{foo}.
3619
3620 @item anything else
3621 For any other non-@code{nil} action code, the @kbd{Mouse-1} event is
3622 translated into a @kbd{Mouse-2} event at the same position.
3623 @end table
3624
3625 To define @kbd{Mouse-1} to activate a button defined with
3626 @code{define-button-type}, give the button a @code{follow-link}
3627 property. The property value should be a link action condition, as
3628 described above. @xref{Buttons}. For example, here is how Help mode
3629 handles @kbd{Mouse-1}:
3630
3631 @smallexample
3632 (define-button-type 'help-xref
3633 'follow-link t
3634 'action #'help-button-action)
3635 @end smallexample
3636
3637 To define @kbd{Mouse-1} on a widget defined with
3638 @code{define-widget}, give the widget a @code{:follow-link} property.
3639 The property value should be a link action condition, as described
3640 above. For example, here is how the @code{link} widget specifies that
3641 a @key{Mouse-1} click shall be translated to @key{RET}:
3642
3643 @smallexample
3644 (define-widget 'link 'item
3645 "An embedded link."
3646 :button-prefix 'widget-link-prefix
3647 :button-suffix 'widget-link-suffix
3648 :follow-link "\C-m"
3649 :help-echo "Follow the link."
3650 :format "%[%t%]")
3651 @end smallexample
3652
3653 @defun mouse-on-link-p pos
3654 This function returns non-@code{nil} if position @var{pos} in the
3655 current buffer is on a link. @var{pos} can also be a mouse event
3656 location, as returned by @code{event-start} (@pxref{Accessing Mouse}).
3657 @end defun
3658
3659 @node Fields
3660 @subsection Defining and Using Fields
3661 @cindex fields
3662
3663 A field is a range of consecutive characters in the buffer that are
3664 identified by having the same value (comparing with @code{eq}) of the
3665 @code{field} property (either a text-property or an overlay property).
3666 This section describes special functions that are available for
3667 operating on fields.
3668
3669 You specify a field with a buffer position, @var{pos}. We think of
3670 each field as containing a range of buffer positions, so the position
3671 you specify stands for the field containing that position.
3672
3673 When the characters before and after @var{pos} are part of the same
3674 field, there is no doubt which field contains @var{pos}: the one those
3675 characters both belong to. When @var{pos} is at a boundary between
3676 fields, which field it belongs to depends on the stickiness of the
3677 @code{field} properties of the two surrounding characters (@pxref{Sticky
3678 Properties}). The field whose property would be inherited by text
3679 inserted at @var{pos} is the field that contains @var{pos}.
3680
3681 There is an anomalous case where newly inserted text at @var{pos}
3682 would not inherit the @code{field} property from either side. This
3683 happens if the previous character's @code{field} property is not
3684 rear-sticky, and the following character's @code{field} property is not
3685 front-sticky. In this case, @var{pos} belongs to neither the preceding
3686 field nor the following field; the field functions treat it as belonging
3687 to an empty field whose beginning and end are both at @var{pos}.
3688
3689 In all of these functions, if @var{pos} is omitted or @code{nil}, the
3690 value of point is used by default. If narrowing is in effect, then
3691 @var{pos} should fall within the accessible portion. @xref{Narrowing}.
3692
3693 @defun field-beginning &optional pos escape-from-edge limit
3694 This function returns the beginning of the field specified by @var{pos}.
3695
3696 If @var{pos} is at the beginning of its field, and
3697 @var{escape-from-edge} is non-@code{nil}, then the return value is
3698 always the beginning of the preceding field that @emph{ends} at @var{pos},
3699 regardless of the stickiness of the @code{field} properties around
3700 @var{pos}.
3701
3702 If @var{limit} is non-@code{nil}, it is a buffer position; if the
3703 beginning of the field is before @var{limit}, then @var{limit} will be
3704 returned instead.
3705 @end defun
3706
3707 @defun field-end &optional pos escape-from-edge limit
3708 This function returns the end of the field specified by @var{pos}.
3709
3710 If @var{pos} is at the end of its field, and @var{escape-from-edge} is
3711 non-@code{nil}, then the return value is always the end of the following
3712 field that @emph{begins} at @var{pos}, regardless of the stickiness of
3713 the @code{field} properties around @var{pos}.
3714
3715 If @var{limit} is non-@code{nil}, it is a buffer position; if the end
3716 of the field is after @var{limit}, then @var{limit} will be returned
3717 instead.
3718 @end defun
3719
3720 @defun field-string &optional pos
3721 This function returns the contents of the field specified by @var{pos},
3722 as a string.
3723 @end defun
3724
3725 @defun field-string-no-properties &optional pos
3726 This function returns the contents of the field specified by @var{pos},
3727 as a string, discarding text properties.
3728 @end defun
3729
3730 @defun delete-field &optional pos
3731 This function deletes the text of the field specified by @var{pos}.
3732 @end defun
3733
3734 @defun constrain-to-field new-pos old-pos &optional escape-from-edge only-in-line inhibit-capture-property
3735 This function ``constrains'' @var{new-pos} to the field that
3736 @var{old-pos} belongs to---in other words, it returns the position
3737 closest to @var{new-pos} that is in the same field as @var{old-pos}.
3738
3739 If @var{new-pos} is @code{nil}, then @code{constrain-to-field} uses
3740 the value of point instead, and moves point to the resulting position
3741 as well as returning it.
3742
3743 If @var{old-pos} is at the boundary of two fields, then the acceptable
3744 final positions depend on the argument @var{escape-from-edge}. If
3745 @var{escape-from-edge} is @code{nil}, then @var{new-pos} must be in
3746 the field whose @code{field} property equals what new characters
3747 inserted at @var{old-pos} would inherit. (This depends on the
3748 stickiness of the @code{field} property for the characters before and
3749 after @var{old-pos}.) If @var{escape-from-edge} is non-@code{nil},
3750 @var{new-pos} can be anywhere in the two adjacent fields.
3751 Additionally, if two fields are separated by another field with the
3752 special value @code{boundary}, then any point within this special
3753 field is also considered to be ``on the boundary.''
3754
3755 Commands like @kbd{C-a} with no argumemt, that normally move backward
3756 to a specific kind of location and stay there once there, probably
3757 should specify @code{nil} for @var{escape-from-edge}. Other motion
3758 commands that check fields should probably pass @code{t}.
3759
3760 If the optional argument @var{only-in-line} is non-@code{nil}, and
3761 constraining @var{new-pos} in the usual way would move it to a different
3762 line, @var{new-pos} is returned unconstrained. This used in commands
3763 that move by line, such as @code{next-line} and
3764 @code{beginning-of-line}, so that they respect field boundaries only in
3765 the case where they can still move to the right line.
3766
3767 If the optional argument @var{inhibit-capture-property} is
3768 non-@code{nil}, and @var{old-pos} has a non-@code{nil} property of that
3769 name, then any field boundaries are ignored.
3770
3771 You can cause @code{constrain-to-field} to ignore all field boundaries
3772 (and so never constrain anything) by binding the variable
3773 @code{inhibit-field-text-motion} to a non-@code{nil} value.
3774 @end defun
3775
3776 @node Not Intervals
3777 @subsection Why Text Properties are not Intervals
3778 @cindex intervals
3779
3780 Some editors that support adding attributes to text in the buffer do
3781 so by letting the user specify ``intervals'' within the text, and adding
3782 the properties to the intervals. Those editors permit the user or the
3783 programmer to determine where individual intervals start and end. We
3784 deliberately provided a different sort of interface in Emacs Lisp to
3785 avoid certain paradoxical behavior associated with text modification.
3786
3787 If the actual subdivision into intervals is meaningful, that means you
3788 can distinguish between a buffer that is just one interval with a
3789 certain property, and a buffer containing the same text subdivided into
3790 two intervals, both of which have that property.
3791
3792 Suppose you take the buffer with just one interval and kill part of
3793 the text. The text remaining in the buffer is one interval, and the
3794 copy in the kill ring (and the undo list) becomes a separate interval.
3795 Then if you yank back the killed text, you get two intervals with the
3796 same properties. Thus, editing does not preserve the distinction
3797 between one interval and two.
3798
3799 Suppose we ``fix'' this problem by coalescing the two intervals when
3800 the text is inserted. That works fine if the buffer originally was a
3801 single interval. But suppose instead that we have two adjacent
3802 intervals with the same properties, and we kill the text of one interval
3803 and yank it back. The same interval-coalescence feature that rescues
3804 the other case causes trouble in this one: after yanking, we have just
3805 one interval. One again, editing does not preserve the distinction
3806 between one interval and two.
3807
3808 Insertion of text at the border between intervals also raises
3809 questions that have no satisfactory answer.
3810
3811 However, it is easy to arrange for editing to behave consistently for
3812 questions of the form, ``What are the properties of this character?''
3813 So we have decided these are the only questions that make sense; we have
3814 not implemented asking questions about where intervals start or end.
3815
3816 In practice, you can usually use the text property search functions in
3817 place of explicit interval boundaries. You can think of them as finding
3818 the boundaries of intervals, assuming that intervals are always
3819 coalesced whenever possible. @xref{Property Search}.
3820
3821 Emacs also provides explicit intervals as a presentation feature; see
3822 @ref{Overlays}.
3823
3824 @node Substitution
3825 @section Substituting for a Character Code
3826
3827 The following functions replace characters within a specified region
3828 based on their character codes.
3829
3830 @defun subst-char-in-region start end old-char new-char &optional noundo
3831 @cindex replace characters
3832 This function replaces all occurrences of the character @var{old-char}
3833 with the character @var{new-char} in the region of the current buffer
3834 defined by @var{start} and @var{end}.
3835
3836 @cindex undo avoidance
3837 If @var{noundo} is non-@code{nil}, then @code{subst-char-in-region} does
3838 not record the change for undo and does not mark the buffer as modified.
3839 This was useful for controlling the old selective display feature
3840 (@pxref{Selective Display}).
3841
3842 @code{subst-char-in-region} does not move point and returns
3843 @code{nil}.
3844
3845 @example
3846 @group
3847 ---------- Buffer: foo ----------
3848 This is the contents of the buffer before.
3849 ---------- Buffer: foo ----------
3850 @end group
3851
3852 @group
3853 (subst-char-in-region 1 20 ?i ?X)
3854 @result{} nil
3855
3856 ---------- Buffer: foo ----------
3857 ThXs Xs the contents of the buffer before.
3858 ---------- Buffer: foo ----------
3859 @end group
3860 @end example
3861 @end defun
3862
3863 @deffn Command translate-region start end table
3864 This function applies a translation table to the characters in the
3865 buffer between positions @var{start} and @var{end}.
3866
3867 The translation table @var{table} is a string or a char-table;
3868 @code{(aref @var{table} @var{ochar})} gives the translated character
3869 corresponding to @var{ochar}. If @var{table} is a string, any
3870 characters with codes larger than the length of @var{table} are not
3871 altered by the translation.
3872
3873 The return value of @code{translate-region} is the number of
3874 characters that were actually changed by the translation. This does
3875 not count characters that were mapped into themselves in the
3876 translation table.
3877 @end deffn
3878
3879 @node Registers
3880 @section Registers
3881 @cindex registers
3882
3883 A register is a sort of variable used in Emacs editing that can hold a
3884 variety of different kinds of values. Each register is named by a
3885 single character. All @acronym{ASCII} characters and their meta variants
3886 (but with the exception of @kbd{C-g}) can be used to name registers.
3887 Thus, there are 255 possible registers. A register is designated in
3888 Emacs Lisp by the character that is its name.
3889
3890 @defvar register-alist
3891 This variable is an alist of elements of the form @code{(@var{name} .
3892 @var{contents})}. Normally, there is one element for each Emacs
3893 register that has been used.
3894
3895 The object @var{name} is a character (an integer) identifying the
3896 register.
3897 @end defvar
3898
3899 The @var{contents} of a register can have several possible types:
3900
3901 @table @asis
3902 @item a number
3903 A number stands for itself. If @code{insert-register} finds a number
3904 in the register, it converts the number to decimal.
3905
3906 @item a marker
3907 A marker represents a buffer position to jump to.
3908
3909 @item a string
3910 A string is text saved in the register.
3911
3912 @item a rectangle
3913 A rectangle is represented by a list of strings.
3914
3915 @item @code{(@var{window-configuration} @var{position})}
3916 This represents a window configuration to restore in one frame, and a
3917 position to jump to in the current buffer.
3918
3919 @item @code{(@var{frame-configuration} @var{position})}
3920 This represents a frame configuration to restore, and a position
3921 to jump to in the current buffer.
3922
3923 @item (file @var{filename})
3924 This represents a file to visit; jumping to this value visits file
3925 @var{filename}.
3926
3927 @item (file-query @var{filename} @var{position})
3928 This represents a file to visit and a position in it; jumping to this
3929 value visits file @var{filename} and goes to buffer position
3930 @var{position}. Restoring this type of position asks the user for
3931 confirmation first.
3932 @end table
3933
3934 The functions in this section return unpredictable values unless
3935 otherwise stated.
3936
3937 @defun get-register reg
3938 This function returns the contents of the register
3939 @var{reg}, or @code{nil} if it has no contents.
3940 @end defun
3941
3942 @defun set-register reg value
3943 This function sets the contents of register @var{reg} to @var{value}.
3944 A register can be set to any value, but the other register functions
3945 expect only certain data types. The return value is @var{value}.
3946 @end defun
3947
3948 @deffn Command view-register reg
3949 This command displays what is contained in register @var{reg}.
3950 @end deffn
3951
3952 @deffn Command insert-register reg &optional beforep
3953 This command inserts contents of register @var{reg} into the current
3954 buffer.
3955
3956 Normally, this command puts point before the inserted text, and the
3957 mark after it. However, if the optional second argument @var{beforep}
3958 is non-@code{nil}, it puts the mark before and point after.
3959 You can pass a non-@code{nil} second argument @var{beforep} to this
3960 function interactively by supplying any prefix argument.
3961
3962 If the register contains a rectangle, then the rectangle is inserted
3963 with its upper left corner at point. This means that text is inserted
3964 in the current line and underneath it on successive lines.
3965
3966 If the register contains something other than saved text (a string) or
3967 a rectangle (a list), currently useless things happen. This may be
3968 changed in the future.
3969 @end deffn
3970
3971 @node Transposition
3972 @section Transposition of Text
3973
3974 This subroutine is used by the transposition commands.
3975
3976 @defun transpose-regions start1 end1 start2 end2 &optional leave-markers
3977 This function exchanges two nonoverlapping portions of the buffer.
3978 Arguments @var{start1} and @var{end1} specify the bounds of one portion
3979 and arguments @var{start2} and @var{end2} specify the bounds of the
3980 other portion.
3981
3982 Normally, @code{transpose-regions} relocates markers with the transposed
3983 text; a marker previously positioned within one of the two transposed
3984 portions moves along with that portion, thus remaining between the same
3985 two characters in their new position. However, if @var{leave-markers}
3986 is non-@code{nil}, @code{transpose-regions} does not do this---it leaves
3987 all markers unrelocated.
3988 @end defun
3989
3990 @node Base 64
3991 @section Base 64 Encoding
3992 @cindex base 64 encoding
3993
3994 Base 64 code is used in email to encode a sequence of 8-bit bytes as
3995 a longer sequence of @acronym{ASCII} graphic characters. It is defined in
3996 Internet RFC@footnote{
3997 An RFC, an acronym for @dfn{Request for Comments}, is a numbered
3998 Internet informational document describing a standard. RFCs are
3999 usually written by technical experts acting on their own initiative,
4000 and are traditionally written in a pragmatic, experience-driven
4001 manner.
4002 }2045. This section describes the functions for
4003 converting to and from this code.
4004
4005 @deffn Command base64-encode-region beg end &optional no-line-break
4006 This function converts the region from @var{beg} to @var{end} into base
4007 64 code. It returns the length of the encoded text. An error is
4008 signaled if a character in the region is multibyte, i.e.@: in a
4009 multibyte buffer the region must contain only characters from the
4010 charsets @code{ascii}, @code{eight-bit-control} and
4011 @code{eight-bit-graphic}.
4012
4013 Normally, this function inserts newline characters into the encoded
4014 text, to avoid overlong lines. However, if the optional argument
4015 @var{no-line-break} is non-@code{nil}, these newlines are not added, so
4016 the output is just one long line.
4017 @end deffn
4018
4019 @deffn Command base64-encode-string string &optional no-line-break
4020 This function converts the string @var{string} into base 64 code. It
4021 returns a string containing the encoded text. As for
4022 @code{base64-encode-region}, an error is signaled if a character in the
4023 string is multibyte.
4024
4025 Normally, this function inserts newline characters into the encoded
4026 text, to avoid overlong lines. However, if the optional argument
4027 @var{no-line-break} is non-@code{nil}, these newlines are not added, so
4028 the result string is just one long line.
4029 @end deffn
4030
4031 @defun base64-decode-region beg end
4032 This function converts the region from @var{beg} to @var{end} from base
4033 64 code into the corresponding decoded text. It returns the length of
4034 the decoded text.
4035
4036 The decoding functions ignore newline characters in the encoded text.
4037 @end defun
4038
4039 @defun base64-decode-string string
4040 This function converts the string @var{string} from base 64 code into
4041 the corresponding decoded text. It returns a unibyte string containing the
4042 decoded text.
4043
4044 The decoding functions ignore newline characters in the encoded text.
4045 @end defun
4046
4047 @node MD5 Checksum
4048 @section MD5 Checksum
4049 @cindex MD5 checksum
4050 @cindex message digest computation
4051
4052 MD5 cryptographic checksums, or @dfn{message digests}, are 128-bit
4053 ``fingerprints'' of a document or program. They are used to verify
4054 that you have an exact and unaltered copy of the data. The algorithm
4055 to calculate the MD5 message digest is defined in Internet
4056 RFC@footnote{
4057 For an explanation of what is an RFC, see the footnote in @ref{Base
4058 64}.
4059 }1321. This section describes the Emacs facilities for computing
4060 message digests.
4061
4062 @defun md5 object &optional start end coding-system noerror
4063 This function returns the MD5 message digest of @var{object}, which
4064 should be a buffer or a string.
4065
4066 The two optional arguments @var{start} and @var{end} are character
4067 positions specifying the portion of @var{object} to compute the
4068 message digest for. If they are @code{nil} or omitted, the digest is
4069 computed for the whole of @var{object}.
4070
4071 The function @code{md5} does not compute the message digest directly
4072 from the internal Emacs representation of the text (@pxref{Text
4073 Representations}). Instead, it encodes the text using a coding
4074 system, and computes the message digest from the encoded text. The
4075 optional fourth argument @var{coding-system} specifies which coding
4076 system to use for encoding the text. It should be the same coding
4077 system that you used to read the text, or that you used or will use
4078 when saving or sending the text. @xref{Coding Systems}, for more
4079 information about coding systems.
4080
4081 If @var{coding-system} is @code{nil} or omitted, the default depends
4082 on @var{object}. If @var{object} is a buffer, the default for
4083 @var{coding-system} is whatever coding system would be chosen by
4084 default for writing this text into a file. If @var{object} is a
4085 string, the user's most preferred coding system (@pxref{Recognize
4086 Coding, prefer-coding-system, the description of
4087 @code{prefer-coding-system}, emacs, GNU Emacs Manual}) is used.
4088
4089 Normally, @code{md5} signals an error if the text can't be encoded
4090 using the specified or chosen coding system. However, if
4091 @var{noerror} is non-@code{nil}, it silently uses @code{raw-text}
4092 coding instead.
4093 @end defun
4094
4095 @node Parsing HTML
4096 @section Parsing HTML
4097 @cindex parsing html
4098 @cindex parsing xml
4099
4100 Emacs provides an interface to the @code{libxml2} library via two
4101 functions: @code{html-parse-buffer} and @code{xml-parse-buffer}. The
4102 HTML function will parse ``real world'' HTML and try to return a
4103 sensible parse tree, while the XML function is somewhat stricter about
4104 syntax.
4105
4106 They both take a two optional parameter. The first is a buffer, and
4107 the second is a base URL to be used to expand relative URLs in the
4108 document, if any.
4109
4110 Here's an example demonstrating the structure of the parsed data you
4111 get out. Given this HTML document:
4112
4113 @example
4114 <html><hEad></head><body width=101><div class=thing>Foo<div>Yes
4115 @end example
4116
4117 You get this parse tree:
4118
4119 @example
4120 (html
4121 (head)
4122 (body
4123 (:width . "101")
4124 (div
4125 (:class . "thing")
4126 (text . "Foo")
4127 (div
4128 (text . "Yes\n")))))
4129 @end example
4130
4131 It's a simple tree structure, where the @code{car} for each node is
4132 the name of the node, and the @code{cdr} is the value, or the list of
4133 values.
4134
4135 Attributes are coded the same way as child nodes, but with @samp{:} as
4136 the first character.
4137
4138 @node Atomic Changes
4139 @section Atomic Change Groups
4140 @cindex atomic changes
4141
4142 In data base terminology, an @dfn{atomic} change is an indivisible
4143 change---it can succeed entirely or it can fail entirely, but it
4144 cannot partly succeed. A Lisp program can make a series of changes to
4145 one or several buffers as an @dfn{atomic change group}, meaning that
4146 either the entire series of changes will be installed in their buffers
4147 or, in case of an error, none of them will be.
4148
4149 To do this for one buffer, the one already current, simply write a
4150 call to @code{atomic-change-group} around the code that makes the
4151 changes, like this:
4152
4153 @example
4154 (atomic-change-group
4155 (insert foo)
4156 (delete-region x y))
4157 @end example
4158
4159 @noindent
4160 If an error (or other nonlocal exit) occurs inside the body of
4161 @code{atomic-change-group}, it unmakes all the changes in that buffer
4162 that were during the execution of the body. This kind of change group
4163 has no effect on any other buffers---any such changes remain.
4164
4165 If you need something more sophisticated, such as to make changes in
4166 various buffers constitute one atomic group, you must directly call
4167 lower-level functions that @code{atomic-change-group} uses.
4168
4169 @defun prepare-change-group &optional buffer
4170 This function sets up a change group for buffer @var{buffer}, which
4171 defaults to the current buffer. It returns a ``handle'' that
4172 represents the change group. You must use this handle to activate the
4173 change group and subsequently to finish it.
4174 @end defun
4175
4176 To use the change group, you must @dfn{activate} it. You must do
4177 this before making any changes in the text of @var{buffer}.
4178
4179 @defun activate-change-group handle
4180 This function activates the change group that @var{handle} designates.
4181 @end defun
4182
4183 After you activate the change group, any changes you make in that
4184 buffer become part of it. Once you have made all the desired changes
4185 in the buffer, you must @dfn{finish} the change group. There are two
4186 ways to do this: you can either accept (and finalize) all the changes,
4187 or cancel them all.
4188
4189 @defun accept-change-group handle
4190 This function accepts all the changes in the change group specified by
4191 @var{handle}, making them final.
4192 @end defun
4193
4194 @defun cancel-change-group handle
4195 This function cancels and undoes all the changes in the change group
4196 specified by @var{handle}.
4197 @end defun
4198
4199 Your code should use @code{unwind-protect} to make sure the group is
4200 always finished. The call to @code{activate-change-group} should be
4201 inside the @code{unwind-protect}, in case the user types @kbd{C-g}
4202 just after it runs. (This is one reason why
4203 @code{prepare-change-group} and @code{activate-change-group} are
4204 separate functions, because normally you would call
4205 @code{prepare-change-group} before the start of that
4206 @code{unwind-protect}.) Once you finish the group, don't use the
4207 handle again---in particular, don't try to finish the same group
4208 twice.
4209
4210 To make a multibuffer change group, call @code{prepare-change-group}
4211 once for each buffer you want to cover, then use @code{nconc} to
4212 combine the returned values, like this:
4213
4214 @example
4215 (nconc (prepare-change-group buffer-1)
4216 (prepare-change-group buffer-2))
4217 @end example
4218
4219 You can then activate the multibuffer change group with a single call
4220 to @code{activate-change-group}, and finish it with a single call to
4221 @code{accept-change-group} or @code{cancel-change-group}.
4222
4223 Nested use of several change groups for the same buffer works as you
4224 would expect. Non-nested use of change groups for the same buffer
4225 will get Emacs confused, so don't let it happen; the first change
4226 group you start for any given buffer should be the last one finished.
4227
4228 @node Change Hooks
4229 @section Change Hooks
4230 @cindex change hooks
4231 @cindex hooks for text changes
4232
4233 These hook variables let you arrange to take notice of all changes in
4234 all buffers (or in a particular buffer, if you make them buffer-local).
4235 See also @ref{Special Properties}, for how to detect changes to specific
4236 parts of the text.
4237
4238 The functions you use in these hooks should save and restore the match
4239 data if they do anything that uses regular expressions; otherwise, they
4240 will interfere in bizarre ways with the editing operations that call
4241 them.
4242
4243 @defvar before-change-functions
4244 This variable holds a list of functions to call before any buffer
4245 modification. Each function gets two arguments, the beginning and end
4246 of the region that is about to change, represented as integers. The
4247 buffer that is about to change is always the current buffer.
4248 @end defvar
4249
4250 @defvar after-change-functions
4251 This variable holds a list of functions to call after any buffer
4252 modification. Each function receives three arguments: the beginning and
4253 end of the region just changed, and the length of the text that existed
4254 before the change. All three arguments are integers. The buffer that's
4255 about to change is always the current buffer.
4256
4257 The length of the old text is the difference between the buffer positions
4258 before and after that text as it was before the change. As for the
4259 changed text, its length is simply the difference between the first two
4260 arguments.
4261 @end defvar
4262
4263 Output of messages into the @samp{*Messages*} buffer does not
4264 call these functions.
4265
4266 @defmac combine-after-change-calls body@dots{}
4267 The macro executes @var{body} normally, but arranges to call the
4268 after-change functions just once for a series of several changes---if
4269 that seems safe.
4270
4271 If a program makes several text changes in the same area of the buffer,
4272 using the macro @code{combine-after-change-calls} around that part of
4273 the program can make it run considerably faster when after-change hooks
4274 are in use. When the after-change hooks are ultimately called, the
4275 arguments specify a portion of the buffer including all of the changes
4276 made within the @code{combine-after-change-calls} body.
4277
4278 @strong{Warning:} You must not alter the values of
4279 @code{after-change-functions} within
4280 the body of a @code{combine-after-change-calls} form.
4281
4282 @strong{Warning:} if the changes you combine occur in widely scattered
4283 parts of the buffer, this will still work, but it is not advisable,
4284 because it may lead to inefficient behavior for some change hook
4285 functions.
4286 @end defmac
4287
4288 @defvar first-change-hook
4289 This variable is a normal hook that is run whenever a buffer is changed
4290 that was previously in the unmodified state.
4291 @end defvar
4292
4293 @defvar inhibit-modification-hooks
4294 If this variable is non-@code{nil}, all of the change hooks are
4295 disabled; none of them run. This affects all the hook variables
4296 described above in this section, as well as the hooks attached to
4297 certain special text properties (@pxref{Special Properties}) and overlay
4298 properties (@pxref{Overlay Properties}).
4299
4300 Also, this variable is bound to non-@code{nil} while running those
4301 same hook variables, so that by default modifying the buffer from
4302 a modification hook does not cause other modification hooks to be run.
4303 If you do want modification hooks to be run in a particular piece of
4304 code that is itself run from a modification hook, then rebind locally
4305 @code{inhibit-modification-hooks} to @code{nil}.
4306 @end defvar
4307
4308