]> code.delx.au - gnu-emacs/blob - doc/lispref/text.texi
Implement copying of a buffer portion while preserving visual order.
[gnu-emacs] / doc / lispref / text.texi
1 @c -*-texinfo-*-
2 @c This is part of the GNU Emacs Lisp Reference Manual.
3 @c Copyright (C) 1990-1995, 1998-2014 Free Software Foundation, Inc.
4 @c See the file elisp.texi for copying conditions.
5 @node Text
6 @chapter Text
7 @cindex text
8
9 This chapter describes the functions that deal with the text in a
10 buffer. Most examine, insert, or delete text in the current buffer,
11 often operating at point or on text adjacent to point. Many are
12 interactive. All the functions that change the text provide for undoing
13 the changes (@pxref{Undo}).
14
15 Many text-related functions operate on a region of text defined by two
16 buffer positions passed in arguments named @var{start} and @var{end}.
17 These arguments should be either markers (@pxref{Markers}) or numeric
18 character positions (@pxref{Positions}). The order of these arguments
19 does not matter; it is all right for @var{start} to be the end of the
20 region and @var{end} the beginning. For example, @code{(delete-region 1
21 10)} and @code{(delete-region 10 1)} are equivalent. An
22 @code{args-out-of-range} error is signaled if either @var{start} or
23 @var{end} is outside the accessible portion of the buffer. In an
24 interactive call, point and the mark are used for these arguments.
25
26 @cindex buffer contents
27 Throughout this chapter, ``text'' refers to the characters in the
28 buffer, together with their properties (when relevant). Keep in mind
29 that point is always between two characters, and the cursor appears on
30 the character after point.
31
32 @menu
33 * Near Point:: Examining text in the vicinity of point.
34 * Buffer Contents:: Examining text in a general fashion.
35 * Comparing Text:: Comparing substrings of buffers.
36 * Insertion:: Adding new text to a buffer.
37 * Commands for Insertion:: User-level commands to insert text.
38 * Deletion:: Removing text from a buffer.
39 * User-Level Deletion:: User-level commands to delete text.
40 * The Kill Ring:: Where removed text sometimes is saved for later use.
41 * Undo:: Undoing changes to the text of a buffer.
42 * Maintaining Undo:: How to enable and disable undo information.
43 How to control how much information is kept.
44 * Filling:: Functions for explicit filling.
45 * Margins:: How to specify margins for filling commands.
46 * Adaptive Fill:: Adaptive Fill mode chooses a fill prefix from context.
47 * Auto Filling:: How auto-fill mode is implemented to break lines.
48 * Sorting:: Functions for sorting parts of the buffer.
49 * Columns:: Computing horizontal positions, and using them.
50 * Indentation:: Functions to insert or adjust indentation.
51 * Case Changes:: Case conversion of parts of the buffer.
52 * Text Properties:: Assigning Lisp property lists to text characters.
53 * Substitution:: Replacing a given character wherever it appears.
54 * Registers:: How registers are implemented. Accessing the text or
55 position stored in a register.
56 * Transposition:: Swapping two portions of a buffer.
57 * Decompression:: Dealing with compressed data.
58 * Base 64:: Conversion to or from base 64 encoding.
59 * Checksum/Hash:: Computing cryptographic hashes.
60 * Parsing HTML/XML:: Parsing HTML and XML.
61 * Atomic Changes:: Installing several buffer changes "atomically".
62 * Change Hooks:: Supplying functions to be run when text is changed.
63 @end menu
64
65 @node Near Point
66 @section Examining Text Near Point
67 @cindex text near point
68
69 Many functions are provided to look at the characters around point.
70 Several simple functions are described here. See also @code{looking-at}
71 in @ref{Regexp Search}.
72
73 In the following four functions, ``beginning'' or ``end'' of buffer
74 refers to the beginning or end of the accessible portion.
75
76 @defun char-after &optional position
77 This function returns the character in the current buffer at (i.e.,
78 immediately after) position @var{position}. If @var{position} is out of
79 range for this purpose, either before the beginning of the buffer, or at
80 or beyond the end, then the value is @code{nil}. The default for
81 @var{position} is point.
82
83 In the following example, assume that the first character in the
84 buffer is @samp{@@}:
85
86 @example
87 @group
88 (string (char-after 1))
89 @result{} "@@"
90 @end group
91 @end example
92 @end defun
93
94 @defun char-before &optional position
95 This function returns the character in the current buffer immediately
96 before position @var{position}. If @var{position} is out of range for
97 this purpose, either at or before the beginning of the buffer, or beyond
98 the end, then the value is @code{nil}. The default for
99 @var{position} is point.
100 @end defun
101
102 @defun following-char
103 This function returns the character following point in the current
104 buffer. This is similar to @code{(char-after (point))}. However, if
105 point is at the end of the buffer, then @code{following-char} returns 0.
106
107 Remember that point is always between characters, and the cursor
108 normally appears over the character following point. Therefore, the
109 character returned by @code{following-char} is the character the
110 cursor is over.
111
112 In this example, point is between the @samp{a} and the @samp{c}.
113
114 @example
115 @group
116 ---------- Buffer: foo ----------
117 Gentlemen may cry ``Pea@point{}ce! Peace!,''
118 but there is no peace.
119 ---------- Buffer: foo ----------
120 @end group
121
122 @group
123 (string (preceding-char))
124 @result{} "a"
125 (string (following-char))
126 @result{} "c"
127 @end group
128 @end example
129 @end defun
130
131 @defun preceding-char
132 This function returns the character preceding point in the current
133 buffer. See above, under @code{following-char}, for an example. If
134 point is at the beginning of the buffer, @code{preceding-char} returns
135 0.
136 @end defun
137
138 @defun bobp
139 This function returns @code{t} if point is at the beginning of the
140 buffer. If narrowing is in effect, this means the beginning of the
141 accessible portion of the text. See also @code{point-min} in
142 @ref{Point}.
143 @end defun
144
145 @defun eobp
146 This function returns @code{t} if point is at the end of the buffer.
147 If narrowing is in effect, this means the end of accessible portion of
148 the text. See also @code{point-max} in @xref{Point}.
149 @end defun
150
151 @defun bolp
152 This function returns @code{t} if point is at the beginning of a line.
153 @xref{Text Lines}. The beginning of the buffer (or of its accessible
154 portion) always counts as the beginning of a line.
155 @end defun
156
157 @defun eolp
158 This function returns @code{t} if point is at the end of a line. The
159 end of the buffer (or of its accessible portion) is always considered
160 the end of a line.
161 @end defun
162
163 @node Buffer Contents
164 @section Examining Buffer Contents
165
166 This section describes functions that allow a Lisp program to
167 convert any portion of the text in the buffer into a string.
168
169 @defun buffer-substring start end
170 This function returns a string containing a copy of the text of the
171 region defined by positions @var{start} and @var{end} in the current
172 buffer. If the arguments are not positions in the accessible portion
173 of the buffer, @code{buffer-substring} signals an
174 @code{args-out-of-range} error.
175
176 Here's an example which assumes Font-Lock mode is not enabled:
177
178 @example
179 @group
180 ---------- Buffer: foo ----------
181 This is the contents of buffer foo
182
183 ---------- Buffer: foo ----------
184 @end group
185
186 @group
187 (buffer-substring 1 10)
188 @result{} "This is t"
189 @end group
190 @group
191 (buffer-substring (point-max) 10)
192 @result{} "he contents of buffer foo\n"
193 @end group
194 @end example
195
196 If the text being copied has any text properties, these are copied into
197 the string along with the characters they belong to. @xref{Text
198 Properties}. However, overlays (@pxref{Overlays}) in the buffer and
199 their properties are ignored, not copied.
200
201 For example, if Font-Lock mode is enabled, you might get results like
202 these:
203
204 @example
205 @group
206 (buffer-substring 1 10)
207 @result{} #("This is t" 0 1 (fontified t) 1 9 (fontified t))
208 @end group
209 @end example
210 @end defun
211
212 @defun buffer-substring-no-properties start end
213 This is like @code{buffer-substring}, except that it does not copy text
214 properties, just the characters themselves. @xref{Text Properties}.
215 @end defun
216
217 @defun buffer-string
218 This function returns the contents of the entire accessible portion of
219 the current buffer, as a string.
220 @end defun
221
222 If you need to make sure the resulting string, when copied to a
223 different location, will not change its visual appearance due to
224 reordering of bidirectional text, use the
225 @code{buffer-substring-with-bidi-context} function
226 (@pxref{Bidirectional Display, buffer-substring-with-bidi-context}).
227
228 @defun filter-buffer-substring start end &optional delete
229 This function filters the buffer text between @var{start} and @var{end}
230 using a function specified by the variable
231 @code{filter-buffer-substring-function}, and returns the result.
232
233 The default filter function consults the obsolete wrapper hook
234 @code{filter-buffer-substring-functions}, and the obsolete variable
235 @code{buffer-substring-filters}. If both of these are @code{nil}, it
236 returns the unaltered text from the buffer, i.e., what
237 @code{buffer-substring} would return.
238
239 If @var{delete} is non-@code{nil}, the function deletes the text
240 between @var{start} and @var{end} after copying it, like
241 @code{delete-and-extract-region}.
242
243 Lisp code should use this function instead of @code{buffer-substring},
244 @code{buffer-substring-no-properties},
245 or @code{delete-and-extract-region} when copying into user-accessible
246 data structures such as the kill-ring, X clipboard, and registers.
247 Major and minor modes can modify @code{filter-buffer-substring-function}
248 to alter such text as it is copied out of the buffer.
249 @end defun
250
251 @defvar filter-buffer-substring-function
252 The value of this variable is a function that @code{filter-buffer-substring}
253 will call to do the actual work. The function receives three
254 arguments, the same as those of @code{filter-buffer-substring},
255 which it should treat as per the documentation of that function. It
256 should return the filtered text (and optionally delete the source text).
257 @end defvar
258
259 @noindent The following two variables are obsoleted by
260 @code{filter-buffer-substring-function}, but are still supported for
261 backward compatibility.
262
263 @defvar filter-buffer-substring-functions
264 This obsolete variable is a wrapper hook, whose members should be functions
265 that accept four arguments: @var{fun}, @var{start}, @var{end}, and
266 @var{delete}. @var{fun} is a function that takes three arguments
267 (@var{start}, @var{end}, and @var{delete}), and returns a string. In
268 both cases, the @var{start}, @var{end}, and @var{delete} arguments are
269 the same as those of @code{filter-buffer-substring}.
270
271 The first hook function is passed a @var{fun} that is equivalent to
272 the default operation of @code{filter-buffer-substring}, i.e., it
273 returns the buffer-substring between @var{start} and @var{end}
274 (processed by any @code{buffer-substring-filters}) and optionally
275 deletes the original text from the buffer. In most cases, the hook
276 function will call @var{fun} once, and then do its own processing of
277 the result. The next hook function receives a @var{fun} equivalent to
278 this, and so on. The actual return value is the result of all the
279 hook functions acting in sequence.
280 @end defvar
281
282 @defvar buffer-substring-filters
283 The value of this obsolete variable should be a list of functions
284 that accept a single string argument and return another string.
285 The default @code{filter-buffer-substring} function passes the buffer
286 substring to the first function in this list, and the return value of
287 each function is passed to the next function. The return value of the
288 last function is passed to @code{filter-buffer-substring-functions}.
289 @end defvar
290
291 @defun current-word &optional strict really-word
292 This function returns the symbol (or word) at or near point, as a
293 string. The return value includes no text properties.
294
295 If the optional argument @var{really-word} is non-@code{nil}, it finds a
296 word; otherwise, it finds a symbol (which includes both word
297 characters and symbol constituent characters).
298
299 If the optional argument @var{strict} is non-@code{nil}, then point
300 must be in or next to the symbol or word---if no symbol or word is
301 there, the function returns @code{nil}. Otherwise, a nearby symbol or
302 word on the same line is acceptable.
303 @end defun
304
305 @defun thing-at-point thing
306 Return the @var{thing} around or next to point, as a string.
307
308 The argument @var{thing} is a symbol which specifies a kind of syntactic
309 entity. Possibilities include @code{symbol}, @code{list}, @code{sexp},
310 @code{defun}, @code{filename}, @code{url}, @code{word}, @code{sentence},
311 @code{whitespace}, @code{line}, @code{page}, and others.
312
313 @example
314 ---------- Buffer: foo ----------
315 Gentlemen may cry ``Pea@point{}ce! Peace!,''
316 but there is no peace.
317 ---------- Buffer: foo ----------
318
319 (thing-at-point 'word)
320 @result{} "Peace"
321 (thing-at-point 'line)
322 @result{} "Gentlemen may cry ``Peace! Peace!,''\n"
323 (thing-at-point 'whitespace)
324 @result{} nil
325 @end example
326 @end defun
327
328 @node Comparing Text
329 @section Comparing Text
330 @cindex comparing buffer text
331
332 This function lets you compare portions of the text in a buffer, without
333 copying them into strings first.
334
335 @defun compare-buffer-substrings buffer1 start1 end1 buffer2 start2 end2
336 This function lets you compare two substrings of the same buffer or two
337 different buffers. The first three arguments specify one substring,
338 giving a buffer (or a buffer name) and two positions within the
339 buffer. The last three arguments specify the other substring in the
340 same way. You can use @code{nil} for @var{buffer1}, @var{buffer2}, or
341 both to stand for the current buffer.
342
343 The value is negative if the first substring is less, positive if the
344 first is greater, and zero if they are equal. The absolute value of
345 the result is one plus the index of the first differing characters
346 within the substrings.
347
348 This function ignores case when comparing characters
349 if @code{case-fold-search} is non-@code{nil}. It always ignores
350 text properties.
351
352 Suppose the current buffer contains the text @samp{foobarbar
353 haha!rara!}; then in this example the two substrings are @samp{rbar }
354 and @samp{rara!}. The value is 2 because the first substring is greater
355 at the second character.
356
357 @example
358 (compare-buffer-substrings nil 6 11 nil 16 21)
359 @result{} 2
360 @end example
361 @end defun
362
363 @node Insertion
364 @section Inserting Text
365 @cindex insertion of text
366 @cindex text insertion
367
368 @cindex insertion before point
369 @cindex before point, insertion
370 @dfn{Insertion} means adding new text to a buffer. The inserted text
371 goes at point---between the character before point and the character
372 after point. Some insertion functions leave point before the inserted
373 text, while other functions leave it after. We call the former
374 insertion @dfn{after point} and the latter insertion @dfn{before point}.
375
376 Insertion moves markers located at positions after the insertion
377 point, so that they stay with the surrounding text (@pxref{Markers}).
378 When a marker points at the place of insertion, insertion may or may
379 not relocate the marker, depending on the marker's insertion type
380 (@pxref{Marker Insertion Types}). Certain special functions such as
381 @code{insert-before-markers} relocate all such markers to point after
382 the inserted text, regardless of the markers' insertion type.
383
384 Insertion functions signal an error if the current buffer is
385 read-only (@pxref{Read Only Buffers}) or if they insert within
386 read-only text (@pxref{Special Properties}).
387
388 These functions copy text characters from strings and buffers along
389 with their properties. The inserted characters have exactly the same
390 properties as the characters they were copied from. By contrast,
391 characters specified as separate arguments, not part of a string or
392 buffer, inherit their text properties from the neighboring text.
393
394 The insertion functions convert text from unibyte to multibyte in
395 order to insert in a multibyte buffer, and vice versa---if the text
396 comes from a string or from a buffer. However, they do not convert
397 unibyte character codes 128 through 255 to multibyte characters, not
398 even if the current buffer is a multibyte buffer. @xref{Converting
399 Representations}.
400
401 @defun insert &rest args
402 This function inserts the strings and/or characters @var{args} into the
403 current buffer, at point, moving point forward. In other words, it
404 inserts the text before point. An error is signaled unless all
405 @var{args} are either strings or characters. The value is @code{nil}.
406 @end defun
407
408 @defun insert-before-markers &rest args
409 This function inserts the strings and/or characters @var{args} into the
410 current buffer, at point, moving point forward. An error is signaled
411 unless all @var{args} are either strings or characters. The value is
412 @code{nil}.
413
414 This function is unlike the other insertion functions in that it
415 relocates markers initially pointing at the insertion point, to point
416 after the inserted text. If an overlay begins at the insertion point,
417 the inserted text falls outside the overlay; if a nonempty overlay
418 ends at the insertion point, the inserted text falls inside that
419 overlay.
420 @end defun
421
422 @deffn Command insert-char character &optional count inherit
423 This command inserts @var{count} instances of @var{character} into the
424 current buffer before point. The argument @var{count} must be an
425 integer, and @var{character} must be a character.
426
427 If called interactively, this command prompts for @var{character}
428 using its Unicode name or its code point. @xref{Inserting Text,,,
429 emacs, The GNU Emacs Manual}.
430
431 This function does not convert unibyte character codes 128 through 255
432 to multibyte characters, not even if the current buffer is a multibyte
433 buffer. @xref{Converting Representations}.
434
435 If @var{inherit} is non-@code{nil}, the inserted characters inherit
436 sticky text properties from the two characters before and after the
437 insertion point. @xref{Sticky Properties}.
438 @end deffn
439
440 @defun insert-buffer-substring from-buffer-or-name &optional start end
441 This function inserts a portion of buffer @var{from-buffer-or-name}
442 into the current buffer before point. The text inserted is the region
443 between @var{start} (inclusive) and @var{end} (exclusive). (These
444 arguments default to the beginning and end of the accessible portion
445 of that buffer.) This function returns @code{nil}.
446
447 In this example, the form is executed with buffer @samp{bar} as the
448 current buffer. We assume that buffer @samp{bar} is initially empty.
449
450 @example
451 @group
452 ---------- Buffer: foo ----------
453 We hold these truths to be self-evident, that all
454 ---------- Buffer: foo ----------
455 @end group
456
457 @group
458 (insert-buffer-substring "foo" 1 20)
459 @result{} nil
460
461 ---------- Buffer: bar ----------
462 We hold these truth@point{}
463 ---------- Buffer: bar ----------
464 @end group
465 @end example
466 @end defun
467
468 @defun insert-buffer-substring-no-properties from-buffer-or-name &optional start end
469 This is like @code{insert-buffer-substring} except that it does not
470 copy any text properties.
471 @end defun
472
473 @xref{Sticky Properties}, for other insertion functions that inherit
474 text properties from the nearby text in addition to inserting it.
475 Whitespace inserted by indentation functions also inherits text
476 properties.
477
478 @node Commands for Insertion
479 @section User-Level Insertion Commands
480
481 This section describes higher-level commands for inserting text,
482 commands intended primarily for the user but useful also in Lisp
483 programs.
484
485 @deffn Command insert-buffer from-buffer-or-name
486 This command inserts the entire accessible contents of
487 @var{from-buffer-or-name} (which must exist) into the current buffer
488 after point. It leaves the mark after the inserted text. The value
489 is @code{nil}.
490 @end deffn
491
492 @deffn Command self-insert-command count
493 @cindex character insertion
494 @cindex self-insertion
495 This command inserts the last character typed; it does so @var{count}
496 times, before point, and returns @code{nil}. Most printing characters
497 are bound to this command. In routine use, @code{self-insert-command}
498 is the most frequently called function in Emacs, but programs rarely use
499 it except to install it on a keymap.
500
501 In an interactive call, @var{count} is the numeric prefix argument.
502
503 @c FIXME: This variable is obsolete since 23.1.
504 Self-insertion translates the input character through
505 @code{translation-table-for-input}. @xref{Translation of Characters}.
506
507 This command calls @code{auto-fill-function} whenever that is
508 non-@code{nil} and the character inserted is in the table
509 @code{auto-fill-chars} (@pxref{Auto Filling}).
510
511 @c Cross refs reworded to prevent overfull hbox. --rjc 15mar92
512 This command performs abbrev expansion if Abbrev mode is enabled and
513 the inserted character does not have word-constituent
514 syntax. (@xref{Abbrevs}, and @ref{Syntax Class Table}.) It is also
515 responsible for calling @code{blink-paren-function} when the inserted
516 character has close parenthesis syntax (@pxref{Blinking}).
517
518 @vindex post-self-insert-hook
519 The final thing this command does is to run the hook
520 @code{post-self-insert-hook}. You could use this to automatically
521 reindent text as it is typed, for example.
522
523 Do not try substituting your own definition of
524 @code{self-insert-command} for the standard one. The editor command
525 loop handles this function specially.
526 @end deffn
527
528 @deffn Command newline &optional number-of-newlines
529 This command inserts newlines into the current buffer before point.
530 If @var{number-of-newlines} is supplied, that many newline characters
531 are inserted.
532
533 @cindex newline and Auto Fill mode
534 This function calls @code{auto-fill-function} if the current column
535 number is greater than the value of @code{fill-column} and
536 @var{number-of-newlines} is @code{nil}. Typically what
537 @code{auto-fill-function} does is insert a newline; thus, the overall
538 result in this case is to insert two newlines at different places: one
539 at point, and another earlier in the line. @code{newline} does not
540 auto-fill if @var{number-of-newlines} is non-@code{nil}.
541
542 This command indents to the left margin if that is not zero.
543 @xref{Margins}.
544
545 The value returned is @code{nil}. In an interactive call, @var{count}
546 is the numeric prefix argument.
547 @end deffn
548
549 @defvar overwrite-mode
550 This variable controls whether overwrite mode is in effect. The value
551 should be @code{overwrite-mode-textual}, @code{overwrite-mode-binary},
552 or @code{nil}. @code{overwrite-mode-textual} specifies textual
553 overwrite mode (treats newlines and tabs specially), and
554 @code{overwrite-mode-binary} specifies binary overwrite mode (treats
555 newlines and tabs like any other characters).
556 @end defvar
557
558 @node Deletion
559 @section Deleting Text
560 @cindex text deletion
561
562 @cindex deleting text vs killing
563 Deletion means removing part of the text in a buffer, without saving
564 it in the kill ring (@pxref{The Kill Ring}). Deleted text can't be
565 yanked, but can be reinserted using the undo mechanism (@pxref{Undo}).
566 Some deletion functions do save text in the kill ring in some special
567 cases.
568
569 All of the deletion functions operate on the current buffer.
570
571 @deffn Command erase-buffer
572 This function deletes the entire text of the current buffer
573 (@emph{not} just the accessible portion), leaving it
574 empty. If the buffer is read-only, it signals a @code{buffer-read-only}
575 error; if some of the text in it is read-only, it signals a
576 @code{text-read-only} error. Otherwise, it deletes the text without
577 asking for any confirmation. It returns @code{nil}.
578
579 Normally, deleting a large amount of text from a buffer inhibits further
580 auto-saving of that buffer ``because it has shrunk''. However,
581 @code{erase-buffer} does not do this, the idea being that the future
582 text is not really related to the former text, and its size should not
583 be compared with that of the former text.
584 @end deffn
585
586 @deffn Command delete-region start end
587 This command deletes the text between positions @var{start} and
588 @var{end} in the current buffer, and returns @code{nil}. If point was
589 inside the deleted region, its value afterward is @var{start}.
590 Otherwise, point relocates with the surrounding text, as markers do.
591 @end deffn
592
593 @defun delete-and-extract-region start end
594 This function deletes the text between positions @var{start} and
595 @var{end} in the current buffer, and returns a string containing the
596 text just deleted.
597
598 If point was inside the deleted region, its value afterward is
599 @var{start}. Otherwise, point relocates with the surrounding text, as
600 markers do.
601 @end defun
602
603 @deffn Command delete-char count &optional killp
604 This command deletes @var{count} characters directly after point, or
605 before point if @var{count} is negative. If @var{killp} is
606 non-@code{nil}, then it saves the deleted characters in the kill ring.
607
608 In an interactive call, @var{count} is the numeric prefix argument, and
609 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
610 argument is supplied, the text is saved in the kill ring. If no prefix
611 argument is supplied, then one character is deleted, but not saved in
612 the kill ring.
613
614 The value returned is always @code{nil}.
615 @end deffn
616
617 @deffn Command delete-backward-char count &optional killp
618 @cindex deleting previous char
619 This command deletes @var{count} characters directly before point, or
620 after point if @var{count} is negative. If @var{killp} is
621 non-@code{nil}, then it saves the deleted characters in the kill ring.
622
623 In an interactive call, @var{count} is the numeric prefix argument, and
624 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
625 argument is supplied, the text is saved in the kill ring. If no prefix
626 argument is supplied, then one character is deleted, but not saved in
627 the kill ring.
628
629 The value returned is always @code{nil}.
630 @end deffn
631
632 @deffn Command backward-delete-char-untabify count &optional killp
633 @cindex tab deletion
634 This command deletes @var{count} characters backward, changing tabs
635 into spaces. When the next character to be deleted is a tab, it is
636 first replaced with the proper number of spaces to preserve alignment
637 and then one of those spaces is deleted instead of the tab. If
638 @var{killp} is non-@code{nil}, then the command saves the deleted
639 characters in the kill ring.
640
641 Conversion of tabs to spaces happens only if @var{count} is positive.
642 If it is negative, exactly @minus{}@var{count} characters after point
643 are deleted.
644
645 In an interactive call, @var{count} is the numeric prefix argument, and
646 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
647 argument is supplied, the text is saved in the kill ring. If no prefix
648 argument is supplied, then one character is deleted, but not saved in
649 the kill ring.
650
651 The value returned is always @code{nil}.
652 @end deffn
653
654 @defopt backward-delete-char-untabify-method
655 This option specifies how @code{backward-delete-char-untabify} should
656 deal with whitespace. Possible values include @code{untabify}, the
657 default, meaning convert a tab to many spaces and delete one;
658 @code{hungry}, meaning delete all tabs and spaces before point with
659 one command; @code{all} meaning delete all tabs, spaces and newlines
660 before point, and @code{nil}, meaning do nothing special for
661 whitespace characters.
662 @end defopt
663
664 @node User-Level Deletion
665 @section User-Level Deletion Commands
666
667 This section describes higher-level commands for deleting text,
668 commands intended primarily for the user but useful also in Lisp
669 programs.
670
671 @deffn Command delete-horizontal-space &optional backward-only
672 @cindex deleting whitespace
673 This function deletes all spaces and tabs around point. It returns
674 @code{nil}.
675
676 If @var{backward-only} is non-@code{nil}, the function deletes
677 spaces and tabs before point, but not after point.
678
679 In the following examples, we call @code{delete-horizontal-space} four
680 times, once on each line, with point between the second and third
681 characters on the line each time.
682
683 @example
684 @group
685 ---------- Buffer: foo ----------
686 I @point{}thought
687 I @point{} thought
688 We@point{} thought
689 Yo@point{}u thought
690 ---------- Buffer: foo ----------
691 @end group
692
693 @group
694 (delete-horizontal-space) ; @r{Four times.}
695 @result{} nil
696
697 ---------- Buffer: foo ----------
698 Ithought
699 Ithought
700 Wethought
701 You thought
702 ---------- Buffer: foo ----------
703 @end group
704 @end example
705 @end deffn
706
707 @deffn Command delete-indentation &optional join-following-p
708 This function joins the line point is on to the previous line, deleting
709 any whitespace at the join and in some cases replacing it with one
710 space. If @var{join-following-p} is non-@code{nil},
711 @code{delete-indentation} joins this line to the following line
712 instead. The function returns @code{nil}.
713
714 If there is a fill prefix, and the second of the lines being joined
715 starts with the prefix, then @code{delete-indentation} deletes the
716 fill prefix before joining the lines. @xref{Margins}.
717
718 In the example below, point is located on the line starting
719 @samp{events}, and it makes no difference if there are trailing spaces
720 in the preceding line.
721
722 @smallexample
723 @group
724 ---------- Buffer: foo ----------
725 When in the course of human
726 @point{} events, it becomes necessary
727 ---------- Buffer: foo ----------
728 @end group
729
730 (delete-indentation)
731 @result{} nil
732
733 @group
734 ---------- Buffer: foo ----------
735 When in the course of human@point{} events, it becomes necessary
736 ---------- Buffer: foo ----------
737 @end group
738 @end smallexample
739
740 After the lines are joined, the function @code{fixup-whitespace} is
741 responsible for deciding whether to leave a space at the junction.
742 @end deffn
743
744 @deffn Command fixup-whitespace
745 This function replaces all the horizontal whitespace surrounding point
746 with either one space or no space, according to the context. It
747 returns @code{nil}.
748
749 At the beginning or end of a line, the appropriate amount of space is
750 none. Before a character with close parenthesis syntax, or after a
751 character with open parenthesis or expression-prefix syntax, no space is
752 also appropriate. Otherwise, one space is appropriate. @xref{Syntax
753 Class Table}.
754
755 In the example below, @code{fixup-whitespace} is called the first time
756 with point before the word @samp{spaces} in the first line. For the
757 second invocation, point is directly after the @samp{(}.
758
759 @smallexample
760 @group
761 ---------- Buffer: foo ----------
762 This has too many @point{}spaces
763 This has too many spaces at the start of (@point{} this list)
764 ---------- Buffer: foo ----------
765 @end group
766
767 @group
768 (fixup-whitespace)
769 @result{} nil
770 (fixup-whitespace)
771 @result{} nil
772 @end group
773
774 @group
775 ---------- Buffer: foo ----------
776 This has too many spaces
777 This has too many spaces at the start of (this list)
778 ---------- Buffer: foo ----------
779 @end group
780 @end smallexample
781 @end deffn
782
783 @deffn Command just-one-space &optional n
784 @comment !!SourceFile simple.el
785 This command replaces any spaces and tabs around point with a single
786 space, or @var{n} spaces if @var{n} is specified. It returns
787 @code{nil}.
788 @end deffn
789
790 @c There is also cycle-spacing, but I cannot see it being useful in
791 @c Lisp programs, so it is not mentioned here.
792
793 @deffn Command delete-blank-lines
794 This function deletes blank lines surrounding point. If point is on a
795 blank line with one or more blank lines before or after it, then all but
796 one of them are deleted. If point is on an isolated blank line, then it
797 is deleted. If point is on a nonblank line, the command deletes all
798 blank lines immediately following it.
799
800 A blank line is defined as a line containing only tabs and spaces.
801 @c and the Newline character?
802
803 @code{delete-blank-lines} returns @code{nil}.
804 @end deffn
805
806 @deffn Command delete-trailing-whitespace start end
807 Delete trailing whitespace in the region defined by @var{start} and
808 @var{end}.
809
810 This command deletes whitespace characters after the last
811 non-whitespace character in each line in the region.
812
813 If this command acts on the entire buffer (i.e. if called
814 interactively with the mark inactive, or called from Lisp with
815 @var{end} @code{nil}), it also deletes all trailing lines at the end of the
816 buffer if the variable @code{delete-trailing-lines} is non-@code{nil}.
817 @end deffn
818
819 @node The Kill Ring
820 @section The Kill Ring
821 @cindex kill ring
822
823 @dfn{Kill functions} delete text like the deletion functions, but save
824 it so that the user can reinsert it by @dfn{yanking}. Most of these
825 functions have @samp{kill-} in their name. By contrast, the functions
826 whose names start with @samp{delete-} normally do not save text for
827 yanking (though they can still be undone); these are ``deletion''
828 functions.
829
830 Most of the kill commands are primarily for interactive use, and are
831 not described here. What we do describe are the functions provided for
832 use in writing such commands. You can use these functions to write
833 commands for killing text. When you need to delete text for internal
834 purposes within a Lisp function, you should normally use deletion
835 functions, so as not to disturb the kill ring contents.
836 @xref{Deletion}.
837
838 Killed text is saved for later yanking in the @dfn{kill ring}. This
839 is a list that holds a number of recent kills, not just the last text
840 kill. We call this a ``ring'' because yanking treats it as having
841 elements in a cyclic order. The list is kept in the variable
842 @code{kill-ring}, and can be operated on with the usual functions for
843 lists; there are also specialized functions, described in this section,
844 that treat it as a ring.
845
846 Some people think this use of the word ``kill'' is unfortunate, since
847 it refers to operations that specifically @emph{do not} destroy the
848 entities ``killed''. This is in sharp contrast to ordinary life, in
849 which death is permanent and ``killed'' entities do not come back to
850 life. Therefore, other metaphors have been proposed. For example, the
851 term ``cut ring'' makes sense to people who, in pre-computer days, used
852 scissors and paste to cut up and rearrange manuscripts. However, it
853 would be difficult to change the terminology now.
854
855 @menu
856 * Kill Ring Concepts:: What text looks like in the kill ring.
857 * Kill Functions:: Functions that kill text.
858 * Yanking:: How yanking is done.
859 * Yank Commands:: Commands that access the kill ring.
860 * Low-Level Kill Ring:: Functions and variables for kill ring access.
861 * Internals of Kill Ring:: Variables that hold kill ring data.
862 @end menu
863
864 @node Kill Ring Concepts
865 @subsection Kill Ring Concepts
866
867 The kill ring records killed text as strings in a list, most recent
868 first. A short kill ring, for example, might look like this:
869
870 @example
871 ("some text" "a different piece of text" "even older text")
872 @end example
873
874 @noindent
875 When the list reaches @code{kill-ring-max} entries in length, adding a
876 new entry automatically deletes the last entry.
877
878 When kill commands are interwoven with other commands, each kill
879 command makes a new entry in the kill ring. Multiple kill commands in
880 succession build up a single kill ring entry, which would be yanked as a
881 unit; the second and subsequent consecutive kill commands add text to
882 the entry made by the first one.
883
884 For yanking, one entry in the kill ring is designated the ``front'' of
885 the ring. Some yank commands ``rotate'' the ring by designating a
886 different element as the ``front''. But this virtual rotation doesn't
887 change the list itself---the most recent entry always comes first in the
888 list.
889
890 @node Kill Functions
891 @subsection Functions for Killing
892
893 @code{kill-region} is the usual subroutine for killing text. Any
894 command that calls this function is a ``kill command'' (and should
895 probably have @samp{kill} in its name). @code{kill-region} puts the
896 newly killed text in a new element at the beginning of the kill ring or
897 adds it to the most recent element. It determines automatically (using
898 @code{last-command}) whether the previous command was a kill command,
899 and if so appends the killed text to the most recent entry.
900
901 @deffn Command kill-region start end
902 This function kills the text in the region defined by @var{start} and
903 @var{end}. The text is deleted but saved in the kill ring, along with
904 its text properties. The value is always @code{nil}.
905
906 In an interactive call, @var{start} and @var{end} are point and
907 the mark.
908
909 If the buffer or text is read-only, @code{kill-region} modifies the kill
910 ring just the same, then signals an error without modifying the buffer.
911 This is convenient because it lets the user use a series of kill
912 commands to copy text from a read-only buffer into the kill ring.
913 @end deffn
914
915 @defopt kill-read-only-ok
916 If this option is non-@code{nil}, @code{kill-region} does not signal an
917 error if the buffer or text is read-only. Instead, it simply returns,
918 updating the kill ring but not changing the buffer.
919 @end defopt
920
921 @deffn Command copy-region-as-kill start end
922 This command saves the region defined by @var{start} and @var{end} on
923 the kill ring (including text properties), but does not delete the text
924 from the buffer. It returns @code{nil}.
925
926 The command does not set @code{this-command} to @code{kill-region}, so a
927 subsequent kill command does not append to the same kill ring entry.
928
929 @c FIXME Why is it better? Why isn't copy-region-as-kill obsolete then?
930 @c Why is it used in many places in Emacs?
931 In Lisp programs, it is better to use @code{kill-new} or
932 @code{kill-append} instead of this command. @xref{Low-Level Kill Ring}.
933 @end deffn
934
935 @node Yanking
936 @subsection Yanking
937
938 Yanking means inserting text from the kill ring, but it does not
939 insert the text blindly. The @code{yank} command, and related
940 commands, use @code{insert-for-yank} to perform special processing on
941 the text before it is inserted.
942
943 @defun insert-for-yank string
944 This function works like @code{insert}, except that it processes the
945 text in @var{string} according to the @code{yank-handler} text
946 property, as well as the variables @code{yank-handled-properties} and
947 @code{yank-excluded-properties} (see below), before inserting the
948 result into the current buffer.
949 @end defun
950
951 @defun insert-buffer-substring-as-yank buf &optional start end
952 This function resembles @code{insert-buffer-substring}, except that it
953 processes the text according to @code{yank-handled-properties} and
954 @code{yank-excluded-properties}. (It does not handle the
955 @code{yank-handler} property, which does not normally occur in buffer
956 text anyway.)
957 @end defun
958
959 @c FIXME: Add an index for yank-handler.
960 If you put a @code{yank-handler} text property on all or part of a
961 string, that alters how @code{insert-for-yank} inserts the string. If
962 different parts of the string have different @code{yank-handler}
963 values (comparison being done with @code{eq}), each substring is
964 handled separately. The property value must be a list of one to four
965 elements, with the following format (where elements after the first
966 may be omitted):
967
968 @example
969 (@var{function} @var{param} @var{noexclude} @var{undo})
970 @end example
971
972 Here is what the elements do:
973
974 @table @var
975 @item function
976 When @var{function} is non-@code{nil}, it is called instead of
977 @code{insert} to insert the string, with one argument---the string to
978 insert.
979
980 @item param
981 If @var{param} is present and non-@code{nil}, it replaces @var{string}
982 (or the substring of @var{string} being processed) as the object
983 passed to @var{function} (or @code{insert}). For example, if
984 @var{function} is @code{yank-rectangle}, @var{param} should be a list
985 of strings to insert as a rectangle.
986
987 @item noexclude
988 If @var{noexclude} is present and non-@code{nil}, that disables the
989 normal action of @code{yank-handled-properties} and
990 @code{yank-excluded-properties} on the inserted string.
991
992 @item undo
993 If @var{undo} is present and non-@code{nil}, it is a function that will be
994 called by @code{yank-pop} to undo the insertion of the current object.
995 It is called with two arguments, the start and end of the current
996 region. @var{function} can set @code{yank-undo-function} to override
997 the @var{undo} value.
998 @end table
999
1000 @cindex yanking and text properties
1001 @defopt yank-handled-properties
1002 This variable specifies special text property handling conditions for
1003 yanked text. It takes effect after the text has been inserted (either
1004 normally, or via the @code{yank-handler} property), and prior to
1005 @code{yank-excluded-properties} taking effect.
1006
1007 The value should be an alist of elements @code{(@var{prop}
1008 . @var{fun})}. Each alist element is handled in order. The inserted
1009 text is scanned for stretches of text having text properties @code{eq}
1010 to @var{prop}; for each such stretch, @var{fun} is called with three
1011 arguments: the value of the property, and the start and end positions
1012 of the text.
1013 @end defopt
1014
1015 @defopt yank-excluded-properties
1016 The value of this variable is the list of properties to remove from
1017 inserted text. Its default value contains properties that might lead
1018 to annoying results, such as causing the text to respond to the mouse
1019 or specifying key bindings. It takes effect after
1020 @code{yank-handled-properties}.
1021 @end defopt
1022
1023
1024 @node Yank Commands
1025 @subsection Functions for Yanking
1026
1027 This section describes higher-level commands for yanking, which are
1028 intended primarily for the user but useful also in Lisp programs.
1029 Both @code{yank} and @code{yank-pop} honor the
1030 @code{yank-excluded-properties} variable and @code{yank-handler} text
1031 property (@pxref{Yanking}).
1032
1033 @deffn Command yank &optional arg
1034 @cindex inserting killed text
1035 This command inserts before point the text at the front of the kill
1036 ring. It sets the mark at the beginning of that text, using
1037 @code{push-mark} (@pxref{The Mark}), and puts point at the end.
1038
1039 If @var{arg} is a non-@code{nil} list (which occurs interactively when
1040 the user types @kbd{C-u} with no digits), then @code{yank} inserts the
1041 text as described above, but puts point before the yanked text and
1042 sets the mark after it.
1043
1044 If @var{arg} is a number, then @code{yank} inserts the @var{arg}th
1045 most recently killed text---the @var{arg}th element of the kill ring
1046 list, counted cyclically from the front, which is considered the
1047 first element for this purpose.
1048
1049 @code{yank} does not alter the contents of the kill ring, unless it
1050 used text provided by another program, in which case it pushes that text
1051 onto the kill ring. However if @var{arg} is an integer different from
1052 one, it rotates the kill ring to place the yanked string at the front.
1053
1054 @code{yank} returns @code{nil}.
1055 @end deffn
1056
1057 @deffn Command yank-pop &optional arg
1058 This command replaces the just-yanked entry from the kill ring with a
1059 different entry from the kill ring.
1060
1061 This is allowed only immediately after a @code{yank} or another
1062 @code{yank-pop}. At such a time, the region contains text that was just
1063 inserted by yanking. @code{yank-pop} deletes that text and inserts in
1064 its place a different piece of killed text. It does not add the deleted
1065 text to the kill ring, since it is already in the kill ring somewhere.
1066 It does however rotate the kill ring to place the newly yanked string at
1067 the front.
1068
1069 If @var{arg} is @code{nil}, then the replacement text is the previous
1070 element of the kill ring. If @var{arg} is numeric, the replacement is
1071 the @var{arg}th previous kill. If @var{arg} is negative, a more recent
1072 kill is the replacement.
1073
1074 The sequence of kills in the kill ring wraps around, so that after the
1075 oldest one comes the newest one, and before the newest one goes the
1076 oldest.
1077
1078 The return value is always @code{nil}.
1079 @end deffn
1080
1081 @defvar yank-undo-function
1082 If this variable is non-@code{nil}, the function @code{yank-pop} uses
1083 its value instead of @code{delete-region} to delete the text
1084 inserted by the previous @code{yank} or
1085 @code{yank-pop} command. The value must be a function of two
1086 arguments, the start and end of the current region.
1087
1088 The function @code{insert-for-yank} automatically sets this variable
1089 according to the @var{undo} element of the @code{yank-handler}
1090 text property, if there is one.
1091 @end defvar
1092
1093 @node Low-Level Kill Ring
1094 @subsection Low-Level Kill Ring
1095
1096 These functions and variables provide access to the kill ring at a
1097 lower level, but are still convenient for use in Lisp programs,
1098 because they take care of interaction with window system selections
1099 (@pxref{Window System Selections}).
1100
1101 @defun current-kill n &optional do-not-move
1102 The function @code{current-kill} rotates the yanking pointer, which
1103 designates the ``front'' of the kill ring, by @var{n} places (from newer
1104 kills to older ones), and returns the text at that place in the ring.
1105
1106 If the optional second argument @var{do-not-move} is non-@code{nil},
1107 then @code{current-kill} doesn't alter the yanking pointer; it just
1108 returns the @var{n}th kill, counting from the current yanking pointer.
1109
1110 If @var{n} is zero, indicating a request for the latest kill,
1111 @code{current-kill} calls the value of
1112 @code{interprogram-paste-function} (documented below) before
1113 consulting the kill ring. If that value is a function and calling it
1114 returns a string or a list of several string, @code{current-kill}
1115 pushes the strings onto the kill ring and returns the first string.
1116 It also sets the yanking pointer to point to the kill-ring entry of
1117 the first string returned by @code{interprogram-paste-function},
1118 regardless of the value of @var{do-not-move}. Otherwise,
1119 @code{current-kill} does not treat a zero value for @var{n} specially:
1120 it returns the entry pointed at by the yanking pointer and does not
1121 move the yanking pointer.
1122 @end defun
1123
1124 @defun kill-new string &optional replace
1125 This function pushes the text @var{string} onto the kill ring and
1126 makes the yanking pointer point to it. It discards the oldest entry
1127 if appropriate. It also invokes the value of
1128 @code{interprogram-cut-function} (see below).
1129
1130 If @var{replace} is non-@code{nil}, then @code{kill-new} replaces the
1131 first element of the kill ring with @var{string}, rather than pushing
1132 @var{string} onto the kill ring.
1133 @end defun
1134
1135 @defun kill-append string before-p
1136 This function appends the text @var{string} to the first entry in the
1137 kill ring and makes the yanking pointer point to the combined entry.
1138 Normally @var{string} goes at the end of the entry, but if
1139 @var{before-p} is non-@code{nil}, it goes at the beginning. This
1140 function also invokes the value of @code{interprogram-cut-function}
1141 (see below).
1142 @end defun
1143
1144 @defvar interprogram-paste-function
1145 This variable provides a way of transferring killed text from other
1146 programs, when you are using a window system. Its value should be
1147 @code{nil} or a function of no arguments.
1148
1149 If the value is a function, @code{current-kill} calls it to get the
1150 ``most recent kill''. If the function returns a non-@code{nil} value,
1151 then that value is used as the ``most recent kill''. If it returns
1152 @code{nil}, then the front of the kill ring is used.
1153
1154 To facilitate support for window systems that support multiple
1155 selections, this function may also return a list of strings. In that
1156 case, the first string is used as the ``most recent kill'', and all
1157 the other strings are pushed onto the kill ring, for easy access by
1158 @code{yank-pop}.
1159
1160 The normal use of this function is to get the window system's
1161 clipboard as the most recent kill, even if the selection belongs to
1162 another application. @xref{Window System Selections}. However, if
1163 the clipboard contents come from the current Emacs session, this
1164 function should return @code{nil}.
1165 @end defvar
1166
1167 @defvar interprogram-cut-function
1168 This variable provides a way of communicating killed text to other
1169 programs, when you are using a window system. Its value should be
1170 @code{nil} or a function of one required argument.
1171
1172 If the value is a function, @code{kill-new} and @code{kill-append} call
1173 it with the new first element of the kill ring as the argument.
1174
1175 The normal use of this function is to put newly killed text in the
1176 window system's clipboard. @xref{Window System Selections}.
1177 @end defvar
1178
1179 @node Internals of Kill Ring
1180 @subsection Internals of the Kill Ring
1181
1182 The variable @code{kill-ring} holds the kill ring contents, in the
1183 form of a list of strings. The most recent kill is always at the front
1184 of the list.
1185
1186 The @code{kill-ring-yank-pointer} variable points to a link in the
1187 kill ring list, whose @sc{car} is the text to yank next. We say it
1188 identifies the ``front'' of the ring. Moving
1189 @code{kill-ring-yank-pointer} to a different link is called
1190 @dfn{rotating the kill ring}. We call the kill ring a ``ring'' because
1191 the functions that move the yank pointer wrap around from the end of the
1192 list to the beginning, or vice-versa. Rotation of the kill ring is
1193 virtual; it does not change the value of @code{kill-ring}.
1194
1195 Both @code{kill-ring} and @code{kill-ring-yank-pointer} are Lisp
1196 variables whose values are normally lists. The word ``pointer'' in the
1197 name of the @code{kill-ring-yank-pointer} indicates that the variable's
1198 purpose is to identify one element of the list for use by the next yank
1199 command.
1200
1201 The value of @code{kill-ring-yank-pointer} is always @code{eq} to one
1202 of the links in the kill ring list. The element it identifies is the
1203 @sc{car} of that link. Kill commands, which change the kill ring, also
1204 set this variable to the value of @code{kill-ring}. The effect is to
1205 rotate the ring so that the newly killed text is at the front.
1206
1207 Here is a diagram that shows the variable @code{kill-ring-yank-pointer}
1208 pointing to the second entry in the kill ring @code{("some text" "a
1209 different piece of text" "yet older text")}.
1210
1211 @example
1212 @group
1213 kill-ring ---- kill-ring-yank-pointer
1214 | |
1215 | v
1216 | --- --- --- --- --- ---
1217 --> | | |------> | | |--> | | |--> nil
1218 --- --- --- --- --- ---
1219 | | |
1220 | | |
1221 | | -->"yet older text"
1222 | |
1223 | --> "a different piece of text"
1224 |
1225 --> "some text"
1226 @end group
1227 @end example
1228
1229 @noindent
1230 This state of affairs might occur after @kbd{C-y} (@code{yank})
1231 immediately followed by @kbd{M-y} (@code{yank-pop}).
1232
1233 @defvar kill-ring
1234 This variable holds the list of killed text sequences, most recently
1235 killed first.
1236 @end defvar
1237
1238 @defvar kill-ring-yank-pointer
1239 This variable's value indicates which element of the kill ring is at the
1240 ``front'' of the ring for yanking. More precisely, the value is a tail
1241 of the value of @code{kill-ring}, and its @sc{car} is the kill string
1242 that @kbd{C-y} should yank.
1243 @end defvar
1244
1245 @defopt kill-ring-max
1246 The value of this variable is the maximum length to which the kill
1247 ring can grow, before elements are thrown away at the end. The default
1248 value for @code{kill-ring-max} is 60.
1249 @end defopt
1250
1251 @node Undo
1252 @section Undo
1253 @cindex redo
1254
1255 Most buffers have an @dfn{undo list}, which records all changes made
1256 to the buffer's text so that they can be undone. (The buffers that
1257 don't have one are usually special-purpose buffers for which Emacs
1258 assumes that undoing is not useful. In particular, any buffer whose
1259 name begins with a space has its undo recording off by default;
1260 see @ref{Buffer Names}.) All the primitives that modify the
1261 text in the buffer automatically add elements to the front of the undo
1262 list, which is in the variable @code{buffer-undo-list}.
1263
1264 @defvar buffer-undo-list
1265 This buffer-local variable's value is the undo list of the current
1266 buffer. A value of @code{t} disables the recording of undo information.
1267 @end defvar
1268
1269 Here are the kinds of elements an undo list can have:
1270
1271 @table @code
1272 @item @var{position}
1273 This kind of element records a previous value of point; undoing this
1274 element moves point to @var{position}. Ordinary cursor motion does not
1275 make any sort of undo record, but deletion operations use these entries
1276 to record where point was before the command.
1277
1278 @item (@var{beg} . @var{end})
1279 This kind of element indicates how to delete text that was inserted.
1280 Upon insertion, the text occupied the range @var{beg}--@var{end} in the
1281 buffer.
1282
1283 @item (@var{text} . @var{position})
1284 This kind of element indicates how to reinsert text that was deleted.
1285 The deleted text itself is the string @var{text}. The place to
1286 reinsert it is @code{(abs @var{position})}. If @var{position} is
1287 positive, point was at the beginning of the deleted text, otherwise it
1288 was at the end. Zero or more (@var{marker} . @var{adjustment})
1289 elements follow immediately after this element.
1290
1291 @item (t . @var{time-flag})
1292 This kind of element indicates that an unmodified buffer became
1293 modified. A @var{time-flag} of the form
1294 @code{(@var{sec-high} @var{sec-low} @var{microsec}
1295 @var{picosec})} represents the visited file's modification time as of
1296 when it was previously visited or saved, using the same format as
1297 @code{current-time}; see @ref{Time of Day}.
1298 A @var{time-flag} of 0 means the buffer does not correspond to any file;
1299 @minus{}1 means the visited file previously did not exist.
1300 @code{primitive-undo} uses these
1301 values to determine whether to mark the buffer as unmodified once again;
1302 it does so only if the file's status matches that of @var{time-flag}.
1303
1304 @item (nil @var{property} @var{value} @var{beg} . @var{end})
1305 This kind of element records a change in a text property.
1306 Here's how you might undo the change:
1307
1308 @example
1309 (put-text-property @var{beg} @var{end} @var{property} @var{value})
1310 @end example
1311
1312 @item (@var{marker} . @var{adjustment})
1313 This kind of element records the fact that the marker @var{marker} was
1314 relocated due to deletion of surrounding text, and that it moved
1315 @var{adjustment} character positions. If the marker's location is
1316 consistent with the (@var{text} . @var{position}) element preceding it
1317 in the undo list, then undoing this element moves @var{marker}
1318 @minus{} @var{adjustment} characters.
1319
1320 @item (apply @var{funname} . @var{args})
1321 This is an extensible undo item, which is undone by calling
1322 @var{funname} with arguments @var{args}.
1323
1324 @item (apply @var{delta} @var{beg} @var{end} @var{funname} . @var{args})
1325 This is an extensible undo item, which records a change limited to the
1326 range @var{beg} to @var{end}, which increased the size of the buffer
1327 by @var{delta} characters. It is undone by calling @var{funname} with
1328 arguments @var{args}.
1329
1330 This kind of element enables undo limited to a region to determine
1331 whether the element pertains to that region.
1332
1333 @item nil
1334 This element is a boundary. The elements between two boundaries are
1335 called a @dfn{change group}; normally, each change group corresponds to
1336 one keyboard command, and undo commands normally undo an entire group as
1337 a unit.
1338 @end table
1339
1340 @defun undo-boundary
1341 This function places a boundary element in the undo list. The undo
1342 command stops at such a boundary, and successive undo commands undo
1343 to earlier and earlier boundaries. This function returns @code{nil}.
1344
1345 The editor command loop automatically calls @code{undo-boundary} just
1346 before executing each key sequence, so that each undo normally undoes
1347 the effects of one command. As an exception, the command
1348 @code{self-insert-command}, which produces self-inserting input
1349 characters (@pxref{Commands for Insertion}), may remove the boundary
1350 inserted by the command loop: a boundary is accepted for the first
1351 such character, the next 19 consecutive self-inserting input
1352 characters do not have boundaries, and then the 20th does; and so on
1353 as long as the self-inserting characters continue. Hence, sequences
1354 of consecutive character insertions can be undone as a group.
1355
1356 All buffer modifications add a boundary whenever the previous undoable
1357 change was made in some other buffer. This is to ensure that
1358 each command makes a boundary in each buffer where it makes changes.
1359
1360 Calling this function explicitly is useful for splitting the effects of
1361 a command into more than one unit. For example, @code{query-replace}
1362 calls @code{undo-boundary} after each replacement, so that the user can
1363 undo individual replacements one by one.
1364 @end defun
1365
1366 @defvar undo-in-progress
1367 This variable is normally @code{nil}, but the undo commands bind it to
1368 @code{t}. This is so that various kinds of change hooks can tell when
1369 they're being called for the sake of undoing.
1370 @end defvar
1371
1372 @defun primitive-undo count list
1373 This is the basic function for undoing elements of an undo list.
1374 It undoes the first @var{count} elements of @var{list}, returning
1375 the rest of @var{list}.
1376
1377 @code{primitive-undo} adds elements to the buffer's undo list when it
1378 changes the buffer. Undo commands avoid confusion by saving the undo
1379 list value at the beginning of a sequence of undo operations. Then the
1380 undo operations use and update the saved value. The new elements added
1381 by undoing are not part of this saved value, so they don't interfere with
1382 continuing to undo.
1383
1384 This function does not bind @code{undo-in-progress}.
1385 @end defun
1386
1387 @node Maintaining Undo
1388 @section Maintaining Undo Lists
1389
1390 This section describes how to enable and disable undo information for
1391 a given buffer. It also explains how the undo list is truncated
1392 automatically so it doesn't get too big.
1393
1394 Recording of undo information in a newly created buffer is normally
1395 enabled to start with; but if the buffer name starts with a space, the
1396 undo recording is initially disabled. You can explicitly enable or
1397 disable undo recording with the following two functions, or by setting
1398 @code{buffer-undo-list} yourself.
1399
1400 @deffn Command buffer-enable-undo &optional buffer-or-name
1401 This command enables recording undo information for buffer
1402 @var{buffer-or-name}, so that subsequent changes can be undone. If no
1403 argument is supplied, then the current buffer is used. This function
1404 does nothing if undo recording is already enabled in the buffer. It
1405 returns @code{nil}.
1406
1407 In an interactive call, @var{buffer-or-name} is the current buffer.
1408 You cannot specify any other buffer.
1409 @end deffn
1410
1411 @deffn Command buffer-disable-undo &optional buffer-or-name
1412 @cindex disabling undo
1413 This function discards the undo list of @var{buffer-or-name}, and disables
1414 further recording of undo information. As a result, it is no longer
1415 possible to undo either previous changes or any subsequent changes. If
1416 the undo list of @var{buffer-or-name} is already disabled, this function
1417 has no effect.
1418
1419 In an interactive call, BUFFER-OR-NAME is the current buffer. You
1420 cannot specify any other buffer. This function returns @code{nil}.
1421 @end deffn
1422
1423 As editing continues, undo lists get longer and longer. To prevent
1424 them from using up all available memory space, garbage collection trims
1425 them back to size limits you can set. (For this purpose, the ``size''
1426 of an undo list measures the cons cells that make up the list, plus the
1427 strings of deleted text.) Three variables control the range of acceptable
1428 sizes: @code{undo-limit}, @code{undo-strong-limit} and
1429 @code{undo-outer-limit}. In these variables, size is counted as the
1430 number of bytes occupied, which includes both saved text and other
1431 data.
1432
1433 @defopt undo-limit
1434 This is the soft limit for the acceptable size of an undo list. The
1435 change group at which this size is exceeded is the last one kept.
1436 @end defopt
1437
1438 @defopt undo-strong-limit
1439 This is the upper limit for the acceptable size of an undo list. The
1440 change group at which this size is exceeded is discarded itself (along
1441 with all older change groups). There is one exception: the very latest
1442 change group is only discarded if it exceeds @code{undo-outer-limit}.
1443 @end defopt
1444
1445 @defopt undo-outer-limit
1446 If at garbage collection time the undo info for the current command
1447 exceeds this limit, Emacs discards the info and displays a warning.
1448 This is a last ditch limit to prevent memory overflow.
1449 @end defopt
1450
1451 @defopt undo-ask-before-discard
1452 If this variable is non-@code{nil}, when the undo info exceeds
1453 @code{undo-outer-limit}, Emacs asks in the echo area whether to
1454 discard the info. The default value is @code{nil}, which means to
1455 discard it automatically.
1456
1457 This option is mainly intended for debugging. Garbage collection is
1458 inhibited while the question is asked, which means that Emacs might
1459 leak memory if the user waits too long before answering the question.
1460 @end defopt
1461
1462 @node Filling
1463 @section Filling
1464 @cindex filling text
1465
1466 @dfn{Filling} means adjusting the lengths of lines (by moving the line
1467 breaks) so that they are nearly (but no greater than) a specified
1468 maximum width. Additionally, lines can be @dfn{justified}, which means
1469 inserting spaces to make the left and/or right margins line up
1470 precisely. The width is controlled by the variable @code{fill-column}.
1471 For ease of reading, lines should be no longer than 70 or so columns.
1472
1473 You can use Auto Fill mode (@pxref{Auto Filling}) to fill text
1474 automatically as you insert it, but changes to existing text may leave
1475 it improperly filled. Then you must fill the text explicitly.
1476
1477 Most of the commands in this section return values that are not
1478 meaningful. All the functions that do filling take note of the current
1479 left margin, current right margin, and current justification style
1480 (@pxref{Margins}). If the current justification style is
1481 @code{none}, the filling functions don't actually do anything.
1482
1483 Several of the filling functions have an argument @var{justify}.
1484 If it is non-@code{nil}, that requests some kind of justification. It
1485 can be @code{left}, @code{right}, @code{full}, or @code{center}, to
1486 request a specific style of justification. If it is @code{t}, that
1487 means to use the current justification style for this part of the text
1488 (see @code{current-justification}, below). Any other value is treated
1489 as @code{full}.
1490
1491 When you call the filling functions interactively, using a prefix
1492 argument implies the value @code{full} for @var{justify}.
1493
1494 @deffn Command fill-paragraph &optional justify region
1495 This command fills the paragraph at or after point. If
1496 @var{justify} is non-@code{nil}, each line is justified as well.
1497 It uses the ordinary paragraph motion commands to find paragraph
1498 boundaries. @xref{Paragraphs,,, emacs, The GNU Emacs Manual}.
1499
1500 When @var{region} is non-@code{nil}, then if Transient Mark mode is
1501 enabled and the mark is active, this command calls @code{fill-region}
1502 to fill all the paragraphs in the region, instead of filling only the
1503 current paragraph. When this command is called interactively,
1504 @var{region} is @code{t}.
1505 @end deffn
1506
1507 @deffn Command fill-region start end &optional justify nosqueeze to-eop
1508 This command fills each of the paragraphs in the region from @var{start}
1509 to @var{end}. It justifies as well if @var{justify} is
1510 non-@code{nil}.
1511
1512 If @var{nosqueeze} is non-@code{nil}, that means to leave whitespace
1513 other than line breaks untouched. If @var{to-eop} is non-@code{nil},
1514 that means to keep filling to the end of the paragraph---or the next hard
1515 newline, if @code{use-hard-newlines} is enabled (see below).
1516
1517 The variable @code{paragraph-separate} controls how to distinguish
1518 paragraphs. @xref{Standard Regexps}.
1519 @end deffn
1520
1521 @deffn Command fill-individual-paragraphs start end &optional justify citation-regexp
1522 This command fills each paragraph in the region according to its
1523 individual fill prefix. Thus, if the lines of a paragraph were indented
1524 with spaces, the filled paragraph will remain indented in the same
1525 fashion.
1526
1527 The first two arguments, @var{start} and @var{end}, are the beginning
1528 and end of the region to be filled. The third and fourth arguments,
1529 @var{justify} and @var{citation-regexp}, are optional. If
1530 @var{justify} is non-@code{nil}, the paragraphs are justified as
1531 well as filled. If @var{citation-regexp} is non-@code{nil}, it means the
1532 function is operating on a mail message and therefore should not fill
1533 the header lines. If @var{citation-regexp} is a string, it is used as
1534 a regular expression; if it matches the beginning of a line, that line
1535 is treated as a citation marker.
1536
1537 @c FIXME: "That mode" is confusing. It isn't a major/minor mode.
1538 Ordinarily, @code{fill-individual-paragraphs} regards each change in
1539 indentation as starting a new paragraph. If
1540 @code{fill-individual-varying-indent} is non-@code{nil}, then only
1541 separator lines separate paragraphs. That mode can handle indented
1542 paragraphs with additional indentation on the first line.
1543 @end deffn
1544
1545 @defopt fill-individual-varying-indent
1546 This variable alters the action of @code{fill-individual-paragraphs} as
1547 described above.
1548 @end defopt
1549
1550 @deffn Command fill-region-as-paragraph start end &optional justify nosqueeze squeeze-after
1551 This command considers a region of text as a single paragraph and fills
1552 it. If the region was made up of many paragraphs, the blank lines
1553 between paragraphs are removed. This function justifies as well as
1554 filling when @var{justify} is non-@code{nil}.
1555
1556 If @var{nosqueeze} is non-@code{nil}, that means to leave whitespace
1557 other than line breaks untouched. If @var{squeeze-after} is
1558 non-@code{nil}, it specifies a position in the region, and means don't
1559 canonicalize spaces before that position.
1560
1561 In Adaptive Fill mode, this command calls @code{fill-context-prefix} to
1562 choose a fill prefix by default. @xref{Adaptive Fill}.
1563 @end deffn
1564
1565 @deffn Command justify-current-line &optional how eop nosqueeze
1566 This command inserts spaces between the words of the current line so
1567 that the line ends exactly at @code{fill-column}. It returns
1568 @code{nil}.
1569
1570 The argument @var{how}, if non-@code{nil} specifies explicitly the style
1571 of justification. It can be @code{left}, @code{right}, @code{full},
1572 @code{center}, or @code{none}. If it is @code{t}, that means to do
1573 follow specified justification style (see @code{current-justification},
1574 below). @code{nil} means to do full justification.
1575
1576 If @var{eop} is non-@code{nil}, that means do only left-justification
1577 if @code{current-justification} specifies full justification. This is
1578 used for the last line of a paragraph; even if the paragraph as a
1579 whole is fully justified, the last line should not be.
1580
1581 If @var{nosqueeze} is non-@code{nil}, that means do not change interior
1582 whitespace.
1583 @end deffn
1584
1585 @defopt default-justification
1586 This variable's value specifies the style of justification to use for
1587 text that doesn't specify a style with a text property. The possible
1588 values are @code{left}, @code{right}, @code{full}, @code{center}, or
1589 @code{none}. The default value is @code{left}.
1590 @end defopt
1591
1592 @defun current-justification
1593 This function returns the proper justification style to use for filling
1594 the text around point.
1595
1596 This returns the value of the @code{justification} text property at
1597 point, or the variable @var{default-justification} if there is no such
1598 text property. However, it returns @code{nil} rather than @code{none}
1599 to mean ``don't justify''.
1600 @end defun
1601
1602 @defopt sentence-end-double-space
1603 @anchor{Definition of sentence-end-double-space}
1604 If this variable is non-@code{nil}, a period followed by just one space
1605 does not count as the end of a sentence, and the filling functions
1606 avoid breaking the line at such a place.
1607 @end defopt
1608
1609 @defopt sentence-end-without-period
1610 If this variable is non-@code{nil}, a sentence can end without a
1611 period. This is used for languages like Thai, where sentences end
1612 with a double space but without a period.
1613 @end defopt
1614
1615 @defopt sentence-end-without-space
1616 If this variable is non-@code{nil}, it should be a string of
1617 characters that can end a sentence without following spaces.
1618 @end defopt
1619
1620 @defvar fill-paragraph-function
1621 This variable provides a way to override the filling of paragraphs.
1622 If its value is non-@code{nil}, @code{fill-paragraph} calls this
1623 function to do the work. If the function returns a non-@code{nil}
1624 value, @code{fill-paragraph} assumes the job is done, and immediately
1625 returns that value.
1626
1627 The usual use of this feature is to fill comments in programming
1628 language modes. If the function needs to fill a paragraph in the usual
1629 way, it can do so as follows:
1630
1631 @example
1632 (let ((fill-paragraph-function nil))
1633 (fill-paragraph arg))
1634 @end example
1635 @end defvar
1636
1637 @defvar fill-forward-paragraph-function
1638 This variable provides a way to override how the filling functions,
1639 such as @code{fill-region} and @code{fill-paragraph}, move forward to
1640 the next paragraph. Its value should be a function, which is called
1641 with a single argument @var{n}, the number of paragraphs to move, and
1642 should return the difference between @var{n} and the number of
1643 paragraphs actually moved. The default value of this variable is
1644 @code{forward-paragraph}. @xref{Paragraphs,,, emacs, The GNU Emacs
1645 Manual}.
1646 @end defvar
1647
1648 @defvar use-hard-newlines
1649 If this variable is non-@code{nil}, the filling functions do not delete
1650 newlines that have the @code{hard} text property. These ``hard
1651 newlines'' act as paragraph separators. @xref{Hard and Soft
1652 Newlines,, Hard and Soft Newlines, emacs, The GNU Emacs Manual}.
1653 @end defvar
1654
1655 @node Margins
1656 @section Margins for Filling
1657 @cindex margins, filling
1658
1659 @defopt fill-prefix
1660 This buffer-local variable, if non-@code{nil}, specifies a string of
1661 text that appears at the beginning of normal text lines and should be
1662 disregarded when filling them. Any line that fails to start with the
1663 fill prefix is considered the start of a paragraph; so is any line
1664 that starts with the fill prefix followed by additional whitespace.
1665 Lines that start with the fill prefix but no additional whitespace are
1666 ordinary text lines that can be filled together. The resulting filled
1667 lines also start with the fill prefix.
1668
1669 The fill prefix follows the left margin whitespace, if any.
1670 @end defopt
1671
1672 @defopt fill-column
1673 This buffer-local variable specifies the maximum width of filled lines.
1674 Its value should be an integer, which is a number of columns. All the
1675 filling, justification, and centering commands are affected by this
1676 variable, including Auto Fill mode (@pxref{Auto Filling}).
1677
1678 As a practical matter, if you are writing text for other people to
1679 read, you should set @code{fill-column} to no more than 70. Otherwise
1680 the line will be too long for people to read comfortably, and this can
1681 make the text seem clumsy.
1682
1683 The default value for @code{fill-column} is 70.
1684 @end defopt
1685
1686 @deffn Command set-left-margin from to margin
1687 This sets the @code{left-margin} property on the text from @var{from} to
1688 @var{to} to the value @var{margin}. If Auto Fill mode is enabled, this
1689 command also refills the region to fit the new margin.
1690 @end deffn
1691
1692 @deffn Command set-right-margin from to margin
1693 This sets the @code{right-margin} property on the text from @var{from}
1694 to @var{to} to the value @var{margin}. If Auto Fill mode is enabled,
1695 this command also refills the region to fit the new margin.
1696 @end deffn
1697
1698 @defun current-left-margin
1699 This function returns the proper left margin value to use for filling
1700 the text around point. The value is the sum of the @code{left-margin}
1701 property of the character at the start of the current line (or zero if
1702 none), and the value of the variable @code{left-margin}.
1703 @end defun
1704
1705 @defun current-fill-column
1706 This function returns the proper fill column value to use for filling
1707 the text around point. The value is the value of the @code{fill-column}
1708 variable, minus the value of the @code{right-margin} property of the
1709 character after point.
1710 @end defun
1711
1712 @deffn Command move-to-left-margin &optional n force
1713 This function moves point to the left margin of the current line. The
1714 column moved to is determined by calling the function
1715 @code{current-left-margin}. If the argument @var{n} is non-@code{nil},
1716 @code{move-to-left-margin} moves forward @var{n}@minus{}1 lines first.
1717
1718 If @var{force} is non-@code{nil}, that says to fix the line's
1719 indentation if that doesn't match the left margin value.
1720 @end deffn
1721
1722 @defun delete-to-left-margin &optional from to
1723 This function removes left margin indentation from the text between
1724 @var{from} and @var{to}. The amount of indentation to delete is
1725 determined by calling @code{current-left-margin}. In no case does this
1726 function delete non-whitespace. If @var{from} and @var{to} are omitted,
1727 they default to the whole buffer.
1728 @end defun
1729
1730 @defun indent-to-left-margin
1731 This function adjusts the indentation at the beginning of the current
1732 line to the value specified by the variable @code{left-margin}. (That
1733 may involve either inserting or deleting whitespace.) This function
1734 is value of @code{indent-line-function} in Paragraph-Indent Text mode.
1735 @end defun
1736
1737 @defopt left-margin
1738 This variable specifies the base left margin column. In Fundamental
1739 mode, @kbd{RET} indents to this column. This variable automatically
1740 becomes buffer-local when set in any fashion.
1741 @end defopt
1742
1743 @defopt fill-nobreak-predicate
1744 This variable gives major modes a way to specify not to break a line
1745 at certain places. Its value should be a list of functions. Whenever
1746 filling considers breaking the line at a certain place in the buffer,
1747 it calls each of these functions with no arguments and with point
1748 located at that place. If any of the functions returns
1749 non-@code{nil}, then the line won't be broken there.
1750 @end defopt
1751
1752 @node Adaptive Fill
1753 @section Adaptive Fill Mode
1754 @c @cindex Adaptive Fill mode "adaptive-fill-mode" is adjacent.
1755
1756 When @dfn{Adaptive Fill Mode} is enabled, Emacs determines the fill
1757 prefix automatically from the text in each paragraph being filled
1758 rather than using a predetermined value. During filling, this fill
1759 prefix gets inserted at the start of the second and subsequent lines
1760 of the paragraph as described in @ref{Filling}, and in @ref{Auto
1761 Filling}.
1762
1763 @defopt adaptive-fill-mode
1764 Adaptive Fill mode is enabled when this variable is non-@code{nil}.
1765 It is @code{t} by default.
1766 @end defopt
1767
1768 @defun fill-context-prefix from to
1769 This function implements the heart of Adaptive Fill mode; it chooses a
1770 fill prefix based on the text between @var{from} and @var{to},
1771 typically the start and end of a paragraph. It does this by looking
1772 at the first two lines of the paragraph, based on the variables
1773 described below.
1774 @c The optional argument first-line-regexp is not documented
1775 @c because it exists for internal purposes and might be eliminated
1776 @c in the future.
1777
1778 Usually, this function returns the fill prefix, a string. However,
1779 before doing this, the function makes a final check (not specially
1780 mentioned in the following) that a line starting with this prefix
1781 wouldn't look like the start of a paragraph. Should this happen, the
1782 function signals the anomaly by returning @code{nil} instead.
1783
1784 In detail, @code{fill-context-prefix} does this:
1785
1786 @enumerate
1787 @item
1788 It takes a candidate for the fill prefix from the first line---it
1789 tries first the function in @code{adaptive-fill-function} (if any),
1790 then the regular expression @code{adaptive-fill-regexp} (see below).
1791 The first non-@code{nil} result of these, or the empty string if
1792 they're both @code{nil}, becomes the first line's candidate.
1793 @item
1794 If the paragraph has as yet only one line, the function tests the
1795 validity of the prefix candidate just found. The function then
1796 returns the candidate if it's valid, or a string of spaces otherwise.
1797 (see the description of @code{adaptive-fill-first-line-regexp} below).
1798 @item
1799 When the paragraph already has two lines, the function next looks for
1800 a prefix candidate on the second line, in just the same way it did for
1801 the first line. If it doesn't find one, it returns @code{nil}.
1802 @item
1803 The function now compares the two candidate prefixes heuristically: if
1804 the non-whitespace characters in the line 2 candidate occur in the
1805 same order in the line 1 candidate, the function returns the line 2
1806 candidate. Otherwise, it returns the largest initial substring which
1807 is common to both candidates (which might be the empty string).
1808 @end enumerate
1809 @end defun
1810
1811 @defopt adaptive-fill-regexp
1812 Adaptive Fill mode matches this regular expression against the text
1813 starting after the left margin whitespace (if any) on a line; the
1814 characters it matches are that line's candidate for the fill prefix.
1815
1816 The default value matches whitespace with certain punctuation
1817 characters intermingled.
1818 @end defopt
1819
1820 @defopt adaptive-fill-first-line-regexp
1821 Used only in one-line paragraphs, this regular expression acts as an
1822 additional check of the validity of the one available candidate fill
1823 prefix: the candidate must match this regular expression, or match
1824 @code{comment-start-skip}. If it doesn't, @code{fill-context-prefix}
1825 replaces the candidate with a string of spaces ``of the same width''
1826 as it.
1827
1828 The default value of this variable is @w{@code{"\\`[ \t]*\\'"}}, which
1829 matches only a string of whitespace. The effect of this default is to
1830 force the fill prefixes found in one-line paragraphs always to be pure
1831 whitespace.
1832 @end defopt
1833
1834 @defopt adaptive-fill-function
1835 You can specify more complex ways of choosing a fill prefix
1836 automatically by setting this variable to a function. The function is
1837 called with point after the left margin (if any) of a line, and it
1838 must preserve point. It should return either ``that line's'' fill
1839 prefix or @code{nil}, meaning it has failed to determine a prefix.
1840 @end defopt
1841
1842 @node Auto Filling
1843 @section Auto Filling
1844 @cindex filling, automatic
1845 @cindex Auto Fill mode
1846
1847 @c FIXME: I don't think any of the variables below is a/an normal/abnormal hook.
1848 Auto Fill mode is a minor mode that fills lines automatically as text
1849 is inserted. This section describes the hook used by Auto Fill mode.
1850 For a description of functions that you can call explicitly to fill and
1851 justify existing text, see @ref{Filling}.
1852
1853 Auto Fill mode also enables the functions that change the margins and
1854 justification style to refill portions of the text. @xref{Margins}.
1855
1856 @defvar auto-fill-function
1857 The value of this buffer-local variable should be a function (of no
1858 arguments) to be called after self-inserting a character from the table
1859 @code{auto-fill-chars}. It may be @code{nil}, in which case nothing
1860 special is done in that case.
1861
1862 The value of @code{auto-fill-function} is @code{do-auto-fill} when
1863 Auto-Fill mode is enabled. That is a function whose sole purpose is to
1864 implement the usual strategy for breaking a line.
1865 @end defvar
1866
1867 @defvar normal-auto-fill-function
1868 This variable specifies the function to use for
1869 @code{auto-fill-function}, if and when Auto Fill is turned on. Major
1870 modes can set buffer-local values for this variable to alter how Auto
1871 Fill works.
1872 @end defvar
1873
1874 @defvar auto-fill-chars
1875 A char table of characters which invoke @code{auto-fill-function} when
1876 self-inserted---space and newline in most language environments. They
1877 have an entry @code{t} in the table.
1878 @end defvar
1879
1880 @node Sorting
1881 @section Sorting Text
1882 @cindex sorting text
1883
1884 The sorting functions described in this section all rearrange text in
1885 a buffer. This is in contrast to the function @code{sort}, which
1886 rearranges the order of the elements of a list (@pxref{Rearrangement}).
1887 The values returned by these functions are not meaningful.
1888
1889 @defun sort-subr reverse nextrecfun endrecfun &optional startkeyfun endkeyfun predicate
1890 This function is the general text-sorting routine that subdivides a
1891 buffer into records and then sorts them. Most of the commands in this
1892 section use this function.
1893
1894 To understand how @code{sort-subr} works, consider the whole accessible
1895 portion of the buffer as being divided into disjoint pieces called
1896 @dfn{sort records}. The records may or may not be contiguous, but they
1897 must not overlap. A portion of each sort record (perhaps all of it) is
1898 designated as the sort key. Sorting rearranges the records in order by
1899 their sort keys.
1900
1901 Usually, the records are rearranged in order of ascending sort key.
1902 If the first argument to the @code{sort-subr} function, @var{reverse},
1903 is non-@code{nil}, the sort records are rearranged in order of
1904 descending sort key.
1905
1906 The next four arguments to @code{sort-subr} are functions that are
1907 called to move point across a sort record. They are called many times
1908 from within @code{sort-subr}.
1909
1910 @enumerate
1911 @item
1912 @var{nextrecfun} is called with point at the end of a record. This
1913 function moves point to the start of the next record. The first record
1914 is assumed to start at the position of point when @code{sort-subr} is
1915 called. Therefore, you should usually move point to the beginning of
1916 the buffer before calling @code{sort-subr}.
1917
1918 This function can indicate there are no more sort records by leaving
1919 point at the end of the buffer.
1920
1921 @item
1922 @var{endrecfun} is called with point within a record. It moves point to
1923 the end of the record.
1924
1925 @item
1926 @var{startkeyfun} is called to move point from the start of a record to
1927 the start of the sort key. This argument is optional; if it is omitted,
1928 the whole record is the sort key. If supplied, the function should
1929 either return a non-@code{nil} value to be used as the sort key, or
1930 return @code{nil} to indicate that the sort key is in the buffer
1931 starting at point. In the latter case, @var{endkeyfun} is called to
1932 find the end of the sort key.
1933
1934 @item
1935 @var{endkeyfun} is called to move point from the start of the sort key
1936 to the end of the sort key. This argument is optional. If
1937 @var{startkeyfun} returns @code{nil} and this argument is omitted (or
1938 @code{nil}), then the sort key extends to the end of the record. There
1939 is no need for @var{endkeyfun} if @var{startkeyfun} returns a
1940 non-@code{nil} value.
1941 @end enumerate
1942
1943 The argument @var{predicate} is the function to use to compare keys.
1944 If keys are numbers, it defaults to @code{<}; otherwise it defaults to
1945 @code{string<}.
1946
1947 As an example of @code{sort-subr}, here is the complete function
1948 definition for @code{sort-lines}:
1949
1950 @example
1951 @group
1952 ;; @r{Note that the first two lines of doc string}
1953 ;; @r{are effectively one line when viewed by a user.}
1954 (defun sort-lines (reverse beg end)
1955 "Sort lines in region alphabetically;\
1956 argument means descending order.
1957 Called from a program, there are three arguments:
1958 @end group
1959 @group
1960 REVERSE (non-nil means reverse order),\
1961 BEG and END (region to sort).
1962 The variable `sort-fold-case' determines\
1963 whether alphabetic case affects
1964 the sort order."
1965 @end group
1966 @group
1967 (interactive "P\nr")
1968 (save-excursion
1969 (save-restriction
1970 (narrow-to-region beg end)
1971 (goto-char (point-min))
1972 (let ((inhibit-field-text-motion t))
1973 (sort-subr reverse 'forward-line 'end-of-line)))))
1974 @end group
1975 @end example
1976
1977 Here @code{forward-line} moves point to the start of the next record,
1978 and @code{end-of-line} moves point to the end of record. We do not pass
1979 the arguments @var{startkeyfun} and @var{endkeyfun}, because the entire
1980 record is used as the sort key.
1981
1982 The @code{sort-paragraphs} function is very much the same, except that
1983 its @code{sort-subr} call looks like this:
1984
1985 @example
1986 @group
1987 (sort-subr reverse
1988 (function
1989 (lambda ()
1990 (while (and (not (eobp))
1991 (looking-at paragraph-separate))
1992 (forward-line 1))))
1993 'forward-paragraph)
1994 @end group
1995 @end example
1996
1997 Markers pointing into any sort records are left with no useful
1998 position after @code{sort-subr} returns.
1999 @end defun
2000
2001 @defopt sort-fold-case
2002 If this variable is non-@code{nil}, @code{sort-subr} and the other
2003 buffer sorting functions ignore case when comparing strings.
2004 @end defopt
2005
2006 @deffn Command sort-regexp-fields reverse record-regexp key-regexp start end
2007 This command sorts the region between @var{start} and @var{end}
2008 alphabetically as specified by @var{record-regexp} and @var{key-regexp}.
2009 If @var{reverse} is a negative integer, then sorting is in reverse
2010 order.
2011
2012 Alphabetical sorting means that two sort keys are compared by
2013 comparing the first characters of each, the second characters of each,
2014 and so on. If a mismatch is found, it means that the sort keys are
2015 unequal; the sort key whose character is less at the point of first
2016 mismatch is the lesser sort key. The individual characters are compared
2017 according to their numerical character codes in the Emacs character set.
2018
2019 The value of the @var{record-regexp} argument specifies how to divide
2020 the buffer into sort records. At the end of each record, a search is
2021 done for this regular expression, and the text that matches it is taken
2022 as the next record. For example, the regular expression @samp{^.+$},
2023 which matches lines with at least one character besides a newline, would
2024 make each such line into a sort record. @xref{Regular Expressions}, for
2025 a description of the syntax and meaning of regular expressions.
2026
2027 The value of the @var{key-regexp} argument specifies what part of each
2028 record is the sort key. The @var{key-regexp} could match the whole
2029 record, or only a part. In the latter case, the rest of the record has
2030 no effect on the sorted order of records, but it is carried along when
2031 the record moves to its new position.
2032
2033 The @var{key-regexp} argument can refer to the text matched by a
2034 subexpression of @var{record-regexp}, or it can be a regular expression
2035 on its own.
2036
2037 If @var{key-regexp} is:
2038
2039 @table @asis
2040 @item @samp{\@var{digit}}
2041 then the text matched by the @var{digit}th @samp{\(...\)} parenthesis
2042 grouping in @var{record-regexp} is the sort key.
2043
2044 @item @samp{\&}
2045 then the whole record is the sort key.
2046
2047 @item a regular expression
2048 then @code{sort-regexp-fields} searches for a match for the regular
2049 expression within the record. If such a match is found, it is the sort
2050 key. If there is no match for @var{key-regexp} within a record then
2051 that record is ignored, which means its position in the buffer is not
2052 changed. (The other records may move around it.)
2053 @end table
2054
2055 For example, if you plan to sort all the lines in the region by the
2056 first word on each line starting with the letter @samp{f}, you should
2057 set @var{record-regexp} to @samp{^.*$} and set @var{key-regexp} to
2058 @samp{\<f\w*\>}. The resulting expression looks like this:
2059
2060 @example
2061 @group
2062 (sort-regexp-fields nil "^.*$" "\\<f\\w*\\>"
2063 (region-beginning)
2064 (region-end))
2065 @end group
2066 @end example
2067
2068 If you call @code{sort-regexp-fields} interactively, it prompts for
2069 @var{record-regexp} and @var{key-regexp} in the minibuffer.
2070 @end deffn
2071
2072 @deffn Command sort-lines reverse start end
2073 This command alphabetically sorts lines in the region between
2074 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2075 is in reverse order.
2076 @end deffn
2077
2078 @deffn Command sort-paragraphs reverse start end
2079 This command alphabetically sorts paragraphs in the region between
2080 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2081 is in reverse order.
2082 @end deffn
2083
2084 @deffn Command sort-pages reverse start end
2085 This command alphabetically sorts pages in the region between
2086 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2087 is in reverse order.
2088 @end deffn
2089
2090 @deffn Command sort-fields field start end
2091 This command sorts lines in the region between @var{start} and
2092 @var{end}, comparing them alphabetically by the @var{field}th field
2093 of each line. Fields are separated by whitespace and numbered starting
2094 from 1. If @var{field} is negative, sorting is by the
2095 @w{@minus{}@var{field}th} field from the end of the line. This command
2096 is useful for sorting tables.
2097 @end deffn
2098
2099 @deffn Command sort-numeric-fields field start end
2100 This command sorts lines in the region between @var{start} and
2101 @var{end}, comparing them numerically by the @var{field}th field of
2102 each line. Fields are separated by whitespace and numbered starting
2103 from 1. The specified field must contain a number in each line of the
2104 region. Numbers starting with 0 are treated as octal, and numbers
2105 starting with @samp{0x} are treated as hexadecimal.
2106
2107 If @var{field} is negative, sorting is by the
2108 @w{@minus{}@var{field}th} field from the end of the line. This
2109 command is useful for sorting tables.
2110 @end deffn
2111
2112 @defopt sort-numeric-base
2113 This variable specifies the default radix for
2114 @code{sort-numeric-fields} to parse numbers.
2115 @end defopt
2116
2117 @deffn Command sort-columns reverse &optional beg end
2118 This command sorts the lines in the region between @var{beg} and
2119 @var{end}, comparing them alphabetically by a certain range of
2120 columns. The column positions of @var{beg} and @var{end} bound the
2121 range of columns to sort on.
2122
2123 If @var{reverse} is non-@code{nil}, the sort is in reverse order.
2124
2125 One unusual thing about this command is that the entire line
2126 containing position @var{beg}, and the entire line containing position
2127 @var{end}, are included in the region sorted.
2128
2129 Note that @code{sort-columns} rejects text that contains tabs, because
2130 tabs could be split across the specified columns. Use @kbd{M-x
2131 untabify} to convert tabs to spaces before sorting.
2132
2133 When possible, this command actually works by calling the @code{sort}
2134 utility program.
2135 @end deffn
2136
2137 @node Columns
2138 @section Counting Columns
2139 @cindex columns
2140 @cindex counting columns
2141 @cindex horizontal position
2142
2143 The column functions convert between a character position (counting
2144 characters from the beginning of the buffer) and a column position
2145 (counting screen characters from the beginning of a line).
2146
2147 These functions count each character according to the number of
2148 columns it occupies on the screen. This means control characters count
2149 as occupying 2 or 4 columns, depending upon the value of
2150 @code{ctl-arrow}, and tabs count as occupying a number of columns that
2151 depends on the value of @code{tab-width} and on the column where the tab
2152 begins. @xref{Usual Display}.
2153
2154 Column number computations ignore the width of the window and the
2155 amount of horizontal scrolling. Consequently, a column value can be
2156 arbitrarily high. The first (or leftmost) column is numbered 0. They
2157 also ignore overlays and text properties, aside from invisibility.
2158
2159 @defun current-column
2160 This function returns the horizontal position of point, measured in
2161 columns, counting from 0 at the left margin. The column position is the
2162 sum of the widths of all the displayed representations of the characters
2163 between the start of the current line and point.
2164
2165 For an example of using @code{current-column}, see the description of
2166 @code{count-lines} in @ref{Text Lines}.
2167 @end defun
2168
2169 @deffn Command move-to-column column &optional force
2170 This function moves point to @var{column} in the current line. The
2171 calculation of @var{column} takes into account the widths of the
2172 displayed representations of the characters between the start of the
2173 line and point.
2174
2175 When called interactively, @var{column} is the value of prefix numeric
2176 argument. If @var{column} is not an integer, an error is signaled.
2177
2178 @c This behavior used to be documented until 2013/08.
2179 @ignore
2180 If column @var{column} is beyond the end of the line, point moves to
2181 the end of the line. If @var{column} is negative, point moves to the
2182 beginning of the line.
2183 @end ignore
2184
2185 If it is impossible to move to column @var{column} because that is in
2186 the middle of a multicolumn character such as a tab, point moves to the
2187 end of that character. However, if @var{force} is non-@code{nil}, and
2188 @var{column} is in the middle of a tab, then @code{move-to-column}
2189 converts the tab into spaces so that it can move precisely to column
2190 @var{column}. Other multicolumn characters can cause anomalies despite
2191 @var{force}, since there is no way to split them.
2192
2193 The argument @var{force} also has an effect if the line isn't long
2194 enough to reach column @var{column}; if it is @code{t}, that means to
2195 add whitespace at the end of the line to reach that column.
2196
2197 The return value is the column number actually moved to.
2198 @end deffn
2199
2200 @node Indentation
2201 @section Indentation
2202 @cindex indentation
2203
2204 The indentation functions are used to examine, move to, and change
2205 whitespace that is at the beginning of a line. Some of the functions
2206 can also change whitespace elsewhere on a line. Columns and indentation
2207 count from zero at the left margin.
2208
2209 @menu
2210 * Primitive Indent:: Functions used to count and insert indentation.
2211 * Mode-Specific Indent:: Customize indentation for different modes.
2212 * Region Indent:: Indent all the lines in a region.
2213 * Relative Indent:: Indent the current line based on previous lines.
2214 * Indent Tabs:: Adjustable, typewriter-like tab stops.
2215 * Motion by Indent:: Move to first non-blank character.
2216 @end menu
2217
2218 @node Primitive Indent
2219 @subsection Indentation Primitives
2220
2221 This section describes the primitive functions used to count and
2222 insert indentation. The functions in the following sections use these
2223 primitives. @xref{Size of Displayed Text}, for related functions.
2224
2225 @defun current-indentation
2226 @comment !!Type Primitive Function
2227 @comment !!SourceFile indent.c
2228 This function returns the indentation of the current line, which is
2229 the horizontal position of the first nonblank character. If the
2230 contents are entirely blank, then this is the horizontal position of the
2231 end of the line.
2232 @end defun
2233
2234 @deffn Command indent-to column &optional minimum
2235 @comment !!Type Primitive Function
2236 @comment !!SourceFile indent.c
2237 This function indents from point with tabs and spaces until @var{column}
2238 is reached. If @var{minimum} is specified and non-@code{nil}, then at
2239 least that many spaces are inserted even if this requires going beyond
2240 @var{column}. Otherwise the function does nothing if point is already
2241 beyond @var{column}. The value is the column at which the inserted
2242 indentation ends.
2243
2244 The inserted whitespace characters inherit text properties from the
2245 surrounding text (usually, from the preceding text only). @xref{Sticky
2246 Properties}.
2247 @end deffn
2248
2249 @defopt indent-tabs-mode
2250 @comment !!SourceFile indent.c
2251 If this variable is non-@code{nil}, indentation functions can insert
2252 tabs as well as spaces. Otherwise, they insert only spaces. Setting
2253 this variable automatically makes it buffer-local in the current buffer.
2254 @end defopt
2255
2256 @node Mode-Specific Indent
2257 @subsection Indentation Controlled by Major Mode
2258
2259 An important function of each major mode is to customize the @key{TAB}
2260 key to indent properly for the language being edited. This section
2261 describes the mechanism of the @key{TAB} key and how to control it.
2262 The functions in this section return unpredictable values.
2263
2264 @deffn Command indent-for-tab-command &optional rigid
2265 This is the command bound to @key{TAB} in most editing modes. Its
2266 usual action is to indent the current line, but it can alternatively
2267 insert a tab character or indent a region.
2268
2269 Here is what it does:
2270
2271 @itemize
2272 @item
2273 First, it checks whether Transient Mark mode is enabled and the region
2274 is active. If so, it called @code{indent-region} to indent all the
2275 text in the region (@pxref{Region Indent}).
2276
2277 @item
2278 Otherwise, if the indentation function in @code{indent-line-function}
2279 is @code{indent-to-left-margin} (a trivial command that inserts a tab
2280 character), or if the variable @code{tab-always-indent} specifies that
2281 a tab character ought to be inserted (see below), then it inserts a
2282 tab character.
2283
2284 @item
2285 Otherwise, it indents the current line; this is done by calling the
2286 function in @code{indent-line-function}. If the line is already
2287 indented, and the value of @code{tab-always-indent} is @code{complete}
2288 (see below), it tries completing the text at point.
2289 @end itemize
2290
2291 If @var{rigid} is non-@code{nil} (interactively, with a prefix
2292 argument), then after this command indents a line or inserts a tab, it
2293 also rigidly indents the entire balanced expression which starts at
2294 the beginning of the current line, in order to reflect the new
2295 indentation. This argument is ignored if the command indents the
2296 region.
2297 @end deffn
2298
2299 @defvar indent-line-function
2300 This variable's value is the function to be used by
2301 @code{indent-for-tab-command}, and various other indentation commands,
2302 to indent the current line. It is usually assigned by the major mode;
2303 for instance, Lisp mode sets it to @code{lisp-indent-line}, C mode
2304 sets it to @code{c-indent-line}, and so on. The default value is
2305 @code{indent-relative}. @xref{Auto-Indentation}.
2306 @end defvar
2307
2308 @deffn Command indent-according-to-mode
2309 This command calls the function in @code{indent-line-function} to
2310 indent the current line in a way appropriate for the current major mode.
2311 @end deffn
2312
2313 @deffn Command newline-and-indent
2314 This function inserts a newline, then indents the new line (the one
2315 following the newline just inserted) according to the major mode. It
2316 does indentation by calling @code{indent-according-to-mode}.
2317 @end deffn
2318
2319 @deffn Command reindent-then-newline-and-indent
2320 This command reindents the current line, inserts a newline at point,
2321 and then indents the new line (the one following the newline just
2322 inserted). It does indentation on both lines by calling
2323 @code{indent-according-to-mode}.
2324 @end deffn
2325
2326 @defopt tab-always-indent
2327 This variable can be used to customize the behavior of the @key{TAB}
2328 (@code{indent-for-tab-command}) command. If the value is @code{t}
2329 (the default), the command normally just indents the current line. If
2330 the value is @code{nil}, the command indents the current line only if
2331 point is at the left margin or in the line's indentation; otherwise,
2332 it inserts a tab character. If the value is @code{complete}, the
2333 command first tries to indent the current line, and if the line was
2334 already indented, it calls @code{completion-at-point} to complete the
2335 text at point (@pxref{Completion in Buffers}).
2336 @end defopt
2337
2338 @node Region Indent
2339 @subsection Indenting an Entire Region
2340
2341 This section describes commands that indent all the lines in the
2342 region. They return unpredictable values.
2343
2344 @deffn Command indent-region start end &optional to-column
2345 This command indents each nonblank line starting between @var{start}
2346 (inclusive) and @var{end} (exclusive). If @var{to-column} is
2347 @code{nil}, @code{indent-region} indents each nonblank line by calling
2348 the current mode's indentation function, the value of
2349 @code{indent-line-function}.
2350
2351 If @var{to-column} is non-@code{nil}, it should be an integer
2352 specifying the number of columns of indentation; then this function
2353 gives each line exactly that much indentation, by either adding or
2354 deleting whitespace.
2355
2356 If there is a fill prefix, @code{indent-region} indents each line
2357 by making it start with the fill prefix.
2358 @end deffn
2359
2360 @defvar indent-region-function
2361 The value of this variable is a function that can be used by
2362 @code{indent-region} as a short cut. It should take two arguments, the
2363 start and end of the region. You should design the function so
2364 that it will produce the same results as indenting the lines of the
2365 region one by one, but presumably faster.
2366
2367 If the value is @code{nil}, there is no short cut, and
2368 @code{indent-region} actually works line by line.
2369
2370 A short-cut function is useful in modes such as C mode and Lisp mode,
2371 where the @code{indent-line-function} must scan from the beginning of
2372 the function definition: applying it to each line would be quadratic in
2373 time. The short cut can update the scan information as it moves through
2374 the lines indenting them; this takes linear time. In a mode where
2375 indenting a line individually is fast, there is no need for a short cut.
2376
2377 @code{indent-region} with a non-@code{nil} argument @var{to-column} has
2378 a different meaning and does not use this variable.
2379 @end defvar
2380
2381 @deffn Command indent-rigidly start end count
2382 This function indents all lines starting between @var{start}
2383 (inclusive) and @var{end} (exclusive) sideways by @var{count} columns.
2384 This ``preserves the shape'' of the affected region, moving it as a
2385 rigid unit.
2386
2387 This is useful not only for indenting regions of unindented text, but
2388 also for indenting regions of formatted code. For example, if
2389 @var{count} is 3, this command adds 3 columns of indentation to every
2390 line that begins in the specified region.
2391
2392 If called interactively with no prefix argument, this command invokes
2393 a transient mode for adjusting indentation rigidly. @xref{Indentation
2394 Commands,,, emacs, The GNU Emacs Manual}.
2395 @end deffn
2396
2397 @deffn Command indent-code-rigidly start end columns &optional nochange-regexp
2398 This is like @code{indent-rigidly}, except that it doesn't alter lines
2399 that start within strings or comments.
2400
2401 In addition, it doesn't alter a line if @var{nochange-regexp} matches at
2402 the beginning of the line (if @var{nochange-regexp} is non-@code{nil}).
2403 @end deffn
2404
2405 @node Relative Indent
2406 @subsection Indentation Relative to Previous Lines
2407
2408 This section describes two commands that indent the current line
2409 based on the contents of previous lines.
2410
2411 @deffn Command indent-relative &optional unindented-ok
2412 This command inserts whitespace at point, extending to the same
2413 column as the next @dfn{indent point} of the previous nonblank line. An
2414 indent point is a non-whitespace character following whitespace. The
2415 next indent point is the first one at a column greater than the current
2416 column of point. For example, if point is underneath and to the left of
2417 the first non-blank character of a line of text, it moves to that column
2418 by inserting whitespace.
2419
2420 If the previous nonblank line has no next indent point (i.e., none at a
2421 great enough column position), @code{indent-relative} either does
2422 nothing (if @var{unindented-ok} is non-@code{nil}) or calls
2423 @code{tab-to-tab-stop}. Thus, if point is underneath and to the right
2424 of the last column of a short line of text, this command ordinarily
2425 moves point to the next tab stop by inserting whitespace.
2426
2427 The return value of @code{indent-relative} is unpredictable.
2428
2429 In the following example, point is at the beginning of the second
2430 line:
2431
2432 @example
2433 @group
2434 This line is indented twelve spaces.
2435 @point{}The quick brown fox jumped.
2436 @end group
2437 @end example
2438
2439 @noindent
2440 Evaluation of the expression @code{(indent-relative nil)} produces the
2441 following:
2442
2443 @example
2444 @group
2445 This line is indented twelve spaces.
2446 @point{}The quick brown fox jumped.
2447 @end group
2448 @end example
2449
2450 In this next example, point is between the @samp{m} and @samp{p} of
2451 @samp{jumped}:
2452
2453 @example
2454 @group
2455 This line is indented twelve spaces.
2456 The quick brown fox jum@point{}ped.
2457 @end group
2458 @end example
2459
2460 @noindent
2461 Evaluation of the expression @code{(indent-relative nil)} produces the
2462 following:
2463
2464 @example
2465 @group
2466 This line is indented twelve spaces.
2467 The quick brown fox jum @point{}ped.
2468 @end group
2469 @end example
2470 @end deffn
2471
2472 @deffn Command indent-relative-maybe
2473 @comment !!SourceFile indent.el
2474 This command indents the current line like the previous nonblank line,
2475 by calling @code{indent-relative} with @code{t} as the
2476 @var{unindented-ok} argument. The return value is unpredictable.
2477
2478 If the previous nonblank line has no indent points beyond the current
2479 column, this command does nothing.
2480 @end deffn
2481
2482 @node Indent Tabs
2483 @subsection Adjustable ``Tab Stops''
2484 @cindex tabs stops for indentation
2485
2486 This section explains the mechanism for user-specified ``tab stops''
2487 and the mechanisms that use and set them. The name ``tab stops'' is
2488 used because the feature is similar to that of the tab stops on a
2489 typewriter. The feature works by inserting an appropriate number of
2490 spaces and tab characters to reach the next tab stop column; it does not
2491 affect the display of tab characters in the buffer (@pxref{Usual
2492 Display}). Note that the @key{TAB} character as input uses this tab
2493 stop feature only in a few major modes, such as Text mode.
2494 @xref{Tab Stops,,, emacs, The GNU Emacs Manual}.
2495
2496 @deffn Command tab-to-tab-stop
2497 This command inserts spaces or tabs before point, up to the next tab
2498 stop column defined by @code{tab-stop-list}.
2499 @end deffn
2500
2501 @defopt tab-stop-list
2502 This variable defines the tab stop columns used by @code{tab-to-tab-stop}.
2503 It should be either @code{nil}, or a list of increasing integers,
2504 which need not be evenly spaced. The list is implicitly
2505 extended to infinity through repetition of the interval between the
2506 last and penultimate elements (or @code{tab-width} if the list has
2507 fewer than two elements). A value of @code{nil} means a tab stop
2508 every @code{tab-width} columns.
2509
2510 Use @kbd{M-x edit-tab-stops} to edit the location of tab stops interactively.
2511 @end defopt
2512
2513 @node Motion by Indent
2514 @subsection Indentation-Based Motion Commands
2515
2516 These commands, primarily for interactive use, act based on the
2517 indentation in the text.
2518
2519 @deffn Command back-to-indentation
2520 @comment !!SourceFile simple.el
2521 This command moves point to the first non-whitespace character in the
2522 current line (which is the line in which point is located). It returns
2523 @code{nil}.
2524 @end deffn
2525
2526 @deffn Command backward-to-indentation &optional arg
2527 @comment !!SourceFile simple.el
2528 This command moves point backward @var{arg} lines and then to the
2529 first nonblank character on that line. It returns @code{nil}.
2530 If @var{arg} is omitted or @code{nil}, it defaults to 1.
2531 @end deffn
2532
2533 @deffn Command forward-to-indentation &optional arg
2534 @comment !!SourceFile simple.el
2535 This command moves point forward @var{arg} lines and then to the first
2536 nonblank character on that line. It returns @code{nil}.
2537 If @var{arg} is omitted or @code{nil}, it defaults to 1.
2538 @end deffn
2539
2540 @node Case Changes
2541 @section Case Changes
2542 @cindex case conversion in buffers
2543
2544 The case change commands described here work on text in the current
2545 buffer. @xref{Case Conversion}, for case conversion functions that work
2546 on strings and characters. @xref{Case Tables}, for how to customize
2547 which characters are upper or lower case and how to convert them.
2548
2549 @deffn Command capitalize-region start end
2550 This function capitalizes all words in the region defined by
2551 @var{start} and @var{end}. To capitalize means to convert each word's
2552 first character to upper case and convert the rest of each word to lower
2553 case. The function returns @code{nil}.
2554
2555 If one end of the region is in the middle of a word, the part of the
2556 word within the region is treated as an entire word.
2557
2558 When @code{capitalize-region} is called interactively, @var{start} and
2559 @var{end} are point and the mark, with the smallest first.
2560
2561 @example
2562 @group
2563 ---------- Buffer: foo ----------
2564 This is the contents of the 5th foo.
2565 ---------- Buffer: foo ----------
2566 @end group
2567
2568 @group
2569 (capitalize-region 1 37)
2570 @result{} nil
2571
2572 ---------- Buffer: foo ----------
2573 This Is The Contents Of The 5th Foo.
2574 ---------- Buffer: foo ----------
2575 @end group
2576 @end example
2577 @end deffn
2578
2579 @deffn Command downcase-region start end
2580 This function converts all of the letters in the region defined by
2581 @var{start} and @var{end} to lower case. The function returns
2582 @code{nil}.
2583
2584 When @code{downcase-region} is called interactively, @var{start} and
2585 @var{end} are point and the mark, with the smallest first.
2586 @end deffn
2587
2588 @deffn Command upcase-region start end
2589 This function converts all of the letters in the region defined by
2590 @var{start} and @var{end} to upper case. The function returns
2591 @code{nil}.
2592
2593 When @code{upcase-region} is called interactively, @var{start} and
2594 @var{end} are point and the mark, with the smallest first.
2595 @end deffn
2596
2597 @deffn Command capitalize-word count
2598 This function capitalizes @var{count} words after point, moving point
2599 over as it does. To capitalize means to convert each word's first
2600 character to upper case and convert the rest of each word to lower case.
2601 If @var{count} is negative, the function capitalizes the
2602 @minus{}@var{count} previous words but does not move point. The value
2603 is @code{nil}.
2604
2605 If point is in the middle of a word, the part of the word before point
2606 is ignored when moving forward. The rest is treated as an entire word.
2607
2608 When @code{capitalize-word} is called interactively, @var{count} is
2609 set to the numeric prefix argument.
2610 @end deffn
2611
2612 @deffn Command downcase-word count
2613 This function converts the @var{count} words after point to all lower
2614 case, moving point over as it does. If @var{count} is negative, it
2615 converts the @minus{}@var{count} previous words but does not move point.
2616 The value is @code{nil}.
2617
2618 When @code{downcase-word} is called interactively, @var{count} is set
2619 to the numeric prefix argument.
2620 @end deffn
2621
2622 @deffn Command upcase-word count
2623 This function converts the @var{count} words after point to all upper
2624 case, moving point over as it does. If @var{count} is negative, it
2625 converts the @minus{}@var{count} previous words but does not move point.
2626 The value is @code{nil}.
2627
2628 When @code{upcase-word} is called interactively, @var{count} is set to
2629 the numeric prefix argument.
2630 @end deffn
2631
2632 @node Text Properties
2633 @section Text Properties
2634 @cindex text properties
2635 @cindex attributes of text
2636 @cindex properties of text
2637
2638 Each character position in a buffer or a string can have a @dfn{text
2639 property list}, much like the property list of a symbol (@pxref{Property
2640 Lists}). The properties belong to a particular character at a
2641 particular place, such as, the letter @samp{T} at the beginning of this
2642 sentence or the first @samp{o} in @samp{foo}---if the same character
2643 occurs in two different places, the two occurrences in general have
2644 different properties.
2645
2646 Each property has a name and a value. Both of these can be any Lisp
2647 object, but the name is normally a symbol. Typically each property
2648 name symbol is used for a particular purpose; for instance, the text
2649 property @code{face} specifies the faces for displaying the character
2650 (@pxref{Special Properties}). The usual way to access the property
2651 list is to specify a name and ask what value corresponds to it.
2652
2653 If a character has a @code{category} property, we call it the
2654 @dfn{property category} of the character. It should be a symbol. The
2655 properties of the symbol serve as defaults for the properties of the
2656 character.
2657
2658 Copying text between strings and buffers preserves the properties
2659 along with the characters; this includes such diverse functions as
2660 @code{substring}, @code{insert}, and @code{buffer-substring}.
2661
2662 @menu
2663 * Examining Properties:: Looking at the properties of one character.
2664 * Changing Properties:: Setting the properties of a range of text.
2665 * Property Search:: Searching for where a property changes value.
2666 * Special Properties:: Particular properties with special meanings.
2667 * Format Properties:: Properties for representing formatting of text.
2668 * Sticky Properties:: How inserted text gets properties from
2669 neighboring text.
2670 * Lazy Properties:: Computing text properties in a lazy fashion
2671 only when text is examined.
2672 * Clickable Text:: Using text properties to make regions of text
2673 do something when you click on them.
2674 * Fields:: The @code{field} property defines
2675 fields within the buffer.
2676 * Not Intervals:: Why text properties do not use
2677 Lisp-visible text intervals.
2678 @end menu
2679
2680 @node Examining Properties
2681 @subsection Examining Text Properties
2682
2683 The simplest way to examine text properties is to ask for the value of
2684 a particular property of a particular character. For that, use
2685 @code{get-text-property}. Use @code{text-properties-at} to get the
2686 entire property list of a character. @xref{Property Search}, for
2687 functions to examine the properties of a number of characters at once.
2688
2689 These functions handle both strings and buffers. Keep in mind that
2690 positions in a string start from 0, whereas positions in a buffer start
2691 from 1.
2692
2693 @defun get-text-property pos prop &optional object
2694 This function returns the value of the @var{prop} property of the
2695 character after position @var{pos} in @var{object} (a buffer or
2696 string). The argument @var{object} is optional and defaults to the
2697 current buffer.
2698
2699 If there is no @var{prop} property strictly speaking, but the character
2700 has a property category that is a symbol, then @code{get-text-property} returns
2701 the @var{prop} property of that symbol.
2702 @end defun
2703
2704 @defun get-char-property position prop &optional object
2705 This function is like @code{get-text-property}, except that it checks
2706 overlays first and then text properties. @xref{Overlays}.
2707
2708 The argument @var{object} may be a string, a buffer, or a window. If
2709 it is a window, then the buffer displayed in that window is used for
2710 text properties and overlays, but only the overlays active for that
2711 window are considered. If @var{object} is a buffer, then overlays in
2712 that buffer are considered first, in order of decreasing priority,
2713 followed by the text properties. If @var{object} is a string, only
2714 text properties are considered, since strings never have overlays.
2715 @end defun
2716
2717 @defun get-pos-property position prop &optional object
2718 This function is like @code{get-char-property}, except that it pays
2719 attention to properties' stickiness and overlays' advancement settings
2720 instead of the property of the character at (i.e. right after)
2721 @var{position}.
2722 @end defun
2723
2724 @defun get-char-property-and-overlay position prop &optional object
2725 This is like @code{get-char-property}, but gives extra information
2726 about the overlay that the property value comes from.
2727
2728 Its value is a cons cell whose @sc{car} is the property value, the
2729 same value @code{get-char-property} would return with the same
2730 arguments. Its @sc{cdr} is the overlay in which the property was
2731 found, or @code{nil}, if it was found as a text property or not found
2732 at all.
2733
2734 If @var{position} is at the end of @var{object}, both the @sc{car} and
2735 the @sc{cdr} of the value are @code{nil}.
2736 @end defun
2737
2738 @defvar char-property-alias-alist
2739 This variable holds an alist which maps property names to a list of
2740 alternative property names. If a character does not specify a direct
2741 value for a property, the alternative property names are consulted in
2742 order; the first non-@code{nil} value is used. This variable takes
2743 precedence over @code{default-text-properties}, and @code{category}
2744 properties take precedence over this variable.
2745 @end defvar
2746
2747 @defun text-properties-at position &optional object
2748 This function returns the entire property list of the character at
2749 @var{position} in the string or buffer @var{object}. If @var{object} is
2750 @code{nil}, it defaults to the current buffer.
2751 @end defun
2752
2753 @defvar default-text-properties
2754 This variable holds a property list giving default values for text
2755 properties. Whenever a character does not specify a value for a
2756 property, neither directly, through a category symbol, or through
2757 @code{char-property-alias-alist}, the value stored in this list is
2758 used instead. Here is an example:
2759
2760 @example
2761 (setq default-text-properties '(foo 69)
2762 char-property-alias-alist nil)
2763 ;; @r{Make sure character 1 has no properties of its own.}
2764 (set-text-properties 1 2 nil)
2765 ;; @r{What we get, when we ask, is the default value.}
2766 (get-text-property 1 'foo)
2767 @result{} 69
2768 @end example
2769 @end defvar
2770
2771 @node Changing Properties
2772 @subsection Changing Text Properties
2773
2774 The primitives for changing properties apply to a specified range of
2775 text in a buffer or string. The function @code{set-text-properties}
2776 (see end of section) sets the entire property list of the text in that
2777 range; more often, it is useful to add, change, or delete just certain
2778 properties specified by name.
2779
2780 Since text properties are considered part of the contents of the
2781 buffer (or string), and can affect how a buffer looks on the screen,
2782 any change in buffer text properties marks the buffer as modified.
2783 Buffer text property changes are undoable also (@pxref{Undo}).
2784 Positions in a string start from 0, whereas positions in a buffer
2785 start from 1.
2786
2787 @defun put-text-property start end prop value &optional object
2788 This function sets the @var{prop} property to @var{value} for the text
2789 between @var{start} and @var{end} in the string or buffer @var{object}.
2790 If @var{object} is @code{nil}, it defaults to the current buffer.
2791 @end defun
2792
2793 @defun add-text-properties start end props &optional object
2794 This function adds or overrides text properties for the text between
2795 @var{start} and @var{end} in the string or buffer @var{object}. If
2796 @var{object} is @code{nil}, it defaults to the current buffer.
2797
2798 The argument @var{props} specifies which properties to add. It should
2799 have the form of a property list (@pxref{Property Lists}): a list whose
2800 elements include the property names followed alternately by the
2801 corresponding values.
2802
2803 The return value is @code{t} if the function actually changed some
2804 property's value; @code{nil} otherwise (if @var{props} is @code{nil} or
2805 its values agree with those in the text).
2806
2807 For example, here is how to set the @code{comment} and @code{face}
2808 properties of a range of text:
2809
2810 @example
2811 (add-text-properties @var{start} @var{end}
2812 '(comment t face highlight))
2813 @end example
2814 @end defun
2815
2816 @defun remove-text-properties start end props &optional object
2817 This function deletes specified text properties from the text between
2818 @var{start} and @var{end} in the string or buffer @var{object}. If
2819 @var{object} is @code{nil}, it defaults to the current buffer.
2820
2821 The argument @var{props} specifies which properties to delete. It
2822 should have the form of a property list (@pxref{Property Lists}): a list
2823 whose elements are property names alternating with corresponding values.
2824 But only the names matter---the values that accompany them are ignored.
2825 For example, here's how to remove the @code{face} property.
2826
2827 @example
2828 (remove-text-properties @var{start} @var{end} '(face nil))
2829 @end example
2830
2831 The return value is @code{t} if the function actually changed some
2832 property's value; @code{nil} otherwise (if @var{props} is @code{nil} or
2833 if no character in the specified text had any of those properties).
2834
2835 To remove all text properties from certain text, use
2836 @code{set-text-properties} and specify @code{nil} for the new property
2837 list.
2838 @end defun
2839
2840 @defun remove-list-of-text-properties start end list-of-properties &optional object
2841 Like @code{remove-text-properties} except that
2842 @var{list-of-properties} is a list of property names only, not an
2843 alternating list of property names and values.
2844 @end defun
2845
2846 @defun set-text-properties start end props &optional object
2847 This function completely replaces the text property list for the text
2848 between @var{start} and @var{end} in the string or buffer @var{object}.
2849 If @var{object} is @code{nil}, it defaults to the current buffer.
2850
2851 The argument @var{props} is the new property list. It should be a list
2852 whose elements are property names alternating with corresponding values.
2853
2854 After @code{set-text-properties} returns, all the characters in the
2855 specified range have identical properties.
2856
2857 If @var{props} is @code{nil}, the effect is to get rid of all properties
2858 from the specified range of text. Here's an example:
2859
2860 @example
2861 (set-text-properties @var{start} @var{end} nil)
2862 @end example
2863
2864 Do not rely on the return value of this function.
2865 @end defun
2866
2867 @defun add-face-text-property start end face &optional appendp object
2868 This function acts on the text between @var{start} and @var{end},
2869 adding the face @var{face} to the @code{face} text property.
2870 @var{face} should be a valid value for the @code{face} property
2871 (@pxref{Special Properties}), such as a face name or an anonymous face
2872 (@pxref{Faces}).
2873
2874 If any text in the region already has a non-@code{nil} @code{face} property,
2875 those face(s) are retained. This function sets the @code{face}
2876 property to a list of faces, with @var{face} as the first element (by
2877 default) and the pre-existing faces as the remaining elements. If the
2878 optional argument @var{append} is non-@code{nil}, @var{face} is
2879 appended to the end of the list instead. Note that in a face list,
2880 the first occurring value for each attribute takes precedence.
2881
2882 For example, the following code would assign a italicized green face
2883 to the text between @var{start} and @var{end}:
2884
2885 @example
2886 (add-face-text-property @var{start} @var{end} 'italic)
2887 (add-face-text-property @var{start} @var{end} '(:foreground "red"))
2888 (add-face-text-property @var{start} @var{end} '(:foreground "green"))
2889 @end example
2890
2891 The optional argument @var{object}, if non-@code{nil}, specifies a
2892 buffer or string to act on, rather than the current buffer. If
2893 @var{object} is a string, then @var{start} and @var{end} are
2894 zero-based indices into the string.
2895 @end defun
2896
2897 The easiest way to make a string with text properties is with
2898 @code{propertize}:
2899
2900 @defun propertize string &rest properties
2901 This function returns a copy of @var{string} with the text properties
2902 @var{properties} added. These properties apply to all the characters
2903 in the string that is returned. Here is an example that constructs a
2904 string with a @code{face} property and a @code{mouse-face} property:
2905
2906 @smallexample
2907 (propertize "foo" 'face 'italic
2908 'mouse-face 'bold-italic)
2909 @result{} #("foo" 0 3 (mouse-face bold-italic face italic))
2910 @end smallexample
2911
2912 To put different properties on various parts of a string, you can
2913 construct each part with @code{propertize} and then combine them with
2914 @code{concat}:
2915
2916 @smallexample
2917 (concat
2918 (propertize "foo" 'face 'italic
2919 'mouse-face 'bold-italic)
2920 " and "
2921 (propertize "bar" 'face 'italic
2922 'mouse-face 'bold-italic))
2923 @result{} #("foo and bar"
2924 0 3 (face italic mouse-face bold-italic)
2925 3 8 nil
2926 8 11 (face italic mouse-face bold-italic))
2927 @end smallexample
2928 @end defun
2929
2930 @xref{Buffer Contents}, for the function
2931 @code{buffer-substring-no-properties}, which copies text from the
2932 buffer but does not copy its properties.
2933
2934 @node Property Search
2935 @subsection Text Property Search Functions
2936
2937 In typical use of text properties, most of the time several or many
2938 consecutive characters have the same value for a property. Rather than
2939 writing your programs to examine characters one by one, it is much
2940 faster to process chunks of text that have the same property value.
2941
2942 Here are functions you can use to do this. They use @code{eq} for
2943 comparing property values. In all cases, @var{object} defaults to the
2944 current buffer.
2945
2946 For good performance, it's very important to use the @var{limit}
2947 argument to these functions, especially the ones that search for a
2948 single property---otherwise, they may spend a long time scanning to the
2949 end of the buffer, if the property you are interested in does not change.
2950
2951 These functions do not move point; instead, they return a position (or
2952 @code{nil}). Remember that a position is always between two characters;
2953 the position returned by these functions is between two characters with
2954 different properties.
2955
2956 @defun next-property-change pos &optional object limit
2957 The function scans the text forward from position @var{pos} in the
2958 string or buffer @var{object} until it finds a change in some text
2959 property, then returns the position of the change. In other words, it
2960 returns the position of the first character beyond @var{pos} whose
2961 properties are not identical to those of the character just after
2962 @var{pos}.
2963
2964 If @var{limit} is non-@code{nil}, then the scan ends at position
2965 @var{limit}. If there is no property change before that point, this
2966 function returns @var{limit}.
2967
2968 The value is @code{nil} if the properties remain unchanged all the way
2969 to the end of @var{object} and @var{limit} is @code{nil}. If the value
2970 is non-@code{nil}, it is a position greater than or equal to @var{pos}.
2971 The value equals @var{pos} only when @var{limit} equals @var{pos}.
2972
2973 Here is an example of how to scan the buffer by chunks of text within
2974 which all properties are constant:
2975
2976 @smallexample
2977 (while (not (eobp))
2978 (let ((plist (text-properties-at (point)))
2979 (next-change
2980 (or (next-property-change (point) (current-buffer))
2981 (point-max))))
2982 @r{Process text from point to @var{next-change}@dots{}}
2983 (goto-char next-change)))
2984 @end smallexample
2985 @end defun
2986
2987 @defun previous-property-change pos &optional object limit
2988 This is like @code{next-property-change}, but scans back from @var{pos}
2989 instead of forward. If the value is non-@code{nil}, it is a position
2990 less than or equal to @var{pos}; it equals @var{pos} only if @var{limit}
2991 equals @var{pos}.
2992 @end defun
2993
2994 @defun next-single-property-change pos prop &optional object limit
2995 The function scans text for a change in the @var{prop} property, then
2996 returns the position of the change. The scan goes forward from
2997 position @var{pos} in the string or buffer @var{object}. In other
2998 words, this function returns the position of the first character
2999 beyond @var{pos} whose @var{prop} property differs from that of the
3000 character just after @var{pos}.
3001
3002 If @var{limit} is non-@code{nil}, then the scan ends at position
3003 @var{limit}. If there is no property change before that point,
3004 @code{next-single-property-change} returns @var{limit}.
3005
3006 The value is @code{nil} if the property remains unchanged all the way to
3007 the end of @var{object} and @var{limit} is @code{nil}. If the value is
3008 non-@code{nil}, it is a position greater than or equal to @var{pos}; it
3009 equals @var{pos} only if @var{limit} equals @var{pos}.
3010 @end defun
3011
3012 @defun previous-single-property-change pos prop &optional object limit
3013 This is like @code{next-single-property-change}, but scans back from
3014 @var{pos} instead of forward. If the value is non-@code{nil}, it is a
3015 position less than or equal to @var{pos}; it equals @var{pos} only if
3016 @var{limit} equals @var{pos}.
3017 @end defun
3018
3019 @defun next-char-property-change pos &optional limit
3020 This is like @code{next-property-change} except that it considers
3021 overlay properties as well as text properties, and if no change is
3022 found before the end of the buffer, it returns the maximum buffer
3023 position rather than @code{nil} (in this sense, it resembles the
3024 corresponding overlay function @code{next-overlay-change}, rather than
3025 @code{next-property-change}). There is no @var{object} operand
3026 because this function operates only on the current buffer. It returns
3027 the next address at which either kind of property changes.
3028 @end defun
3029
3030 @defun previous-char-property-change pos &optional limit
3031 This is like @code{next-char-property-change}, but scans back from
3032 @var{pos} instead of forward, and returns the minimum buffer
3033 position if no change is found.
3034 @end defun
3035
3036 @defun next-single-char-property-change pos prop &optional object limit
3037 This is like @code{next-single-property-change} except that it
3038 considers overlay properties as well as text properties, and if no
3039 change is found before the end of the @var{object}, it returns the
3040 maximum valid position in @var{object} rather than @code{nil}. Unlike
3041 @code{next-char-property-change}, this function @emph{does} have an
3042 @var{object} operand; if @var{object} is not a buffer, only
3043 text-properties are considered.
3044 @end defun
3045
3046 @defun previous-single-char-property-change pos prop &optional object limit
3047 This is like @code{next-single-char-property-change}, but scans back
3048 from @var{pos} instead of forward, and returns the minimum valid
3049 position in @var{object} if no change is found.
3050 @end defun
3051
3052 @defun text-property-any start end prop value &optional object
3053 This function returns non-@code{nil} if at least one character between
3054 @var{start} and @var{end} has a property @var{prop} whose value is
3055 @var{value}. More precisely, it returns the position of the first such
3056 character. Otherwise, it returns @code{nil}.
3057
3058 The optional fifth argument, @var{object}, specifies the string or
3059 buffer to scan. Positions are relative to @var{object}. The default
3060 for @var{object} is the current buffer.
3061 @end defun
3062
3063 @defun text-property-not-all start end prop value &optional object
3064 This function returns non-@code{nil} if at least one character between
3065 @var{start} and @var{end} does not have a property @var{prop} with value
3066 @var{value}. More precisely, it returns the position of the first such
3067 character. Otherwise, it returns @code{nil}.
3068
3069 The optional fifth argument, @var{object}, specifies the string or
3070 buffer to scan. Positions are relative to @var{object}. The default
3071 for @var{object} is the current buffer.
3072 @end defun
3073
3074 @node Special Properties
3075 @subsection Properties with Special Meanings
3076
3077 Here is a table of text property names that have special built-in
3078 meanings. The following sections list a few additional special property
3079 names that control filling and property inheritance. All other names
3080 have no standard meaning, and you can use them as you like.
3081
3082 Note: the properties @code{composition}, @code{display},
3083 @code{invisible} and @code{intangible} can also cause point to move to
3084 an acceptable place, after each Emacs command. @xref{Adjusting
3085 Point}.
3086
3087 @table @code
3088 @cindex property category of text character
3089 @c FIXME: Isn't @kindex for keyboard commands?
3090 @kindex category @r{(text property)}
3091 @item category
3092 If a character has a @code{category} property, we call it the
3093 @dfn{property category} of the character. It should be a symbol. The
3094 properties of this symbol serve as defaults for the properties of the
3095 character.
3096
3097 @item face
3098 @cindex face codes of text
3099 @kindex face @r{(text property)}
3100 The @code{face} property controls the appearance of the character
3101 (@pxref{Faces}). The value of the property can be the following:
3102
3103 @itemize @bullet
3104 @item
3105 A face name (a symbol or string).
3106
3107 @item
3108 An anonymous face: a property list of the form @code{(@var{keyword}
3109 @var{value} @dots{})}, where each @var{keyword} is a face attribute
3110 name and @var{value} is a value for that attribute.
3111
3112 @item
3113 A list of faces. Each list element should be either a face name or an
3114 anonymous face. This specifies a face which is an aggregate of the
3115 attributes of each of the listed faces. Faces occurring earlier in
3116 the list have higher priority.
3117
3118 @item
3119 A cons cell of the form @code{(foreground-color . @var{color-name})}
3120 or @code{(background-color . @var{color-name})}. This specifies the
3121 foreground or background color, similar to @code{(:foreground
3122 @var{color-name})} or @code{(:background @var{color-name})}. This
3123 form is supported for backward compatibility only, and should be
3124 avoided.
3125 @end itemize
3126
3127 Font Lock mode (@pxref{Font Lock Mode}) works in most buffers by
3128 dynamically updating the @code{face} property of characters based on
3129 the context.
3130
3131 The @code{add-face-text-property} function provides a convenient way
3132 to set this text property. @xref{Changing Properties}.
3133
3134 @item font-lock-face
3135 @kindex font-lock-face @r{(text property)}
3136 This property specifies a value for the @code{face} property that Font
3137 Lock mode should apply to the underlying text. It is one of the
3138 fontification methods used by Font Lock mode, and is useful for
3139 special modes that implement their own highlighting.
3140 @xref{Precalculated Fontification}. When Font Lock mode is disabled,
3141 @code{font-lock-face} has no effect.
3142
3143 @item mouse-face
3144 @kindex mouse-face @r{(text property)}
3145 This property is used instead of @code{face} when the mouse is on or
3146 near the character. For this purpose, ``near'' means that all text
3147 between the character and where the mouse is have the same
3148 @code{mouse-face} property value.
3149
3150 Emacs ignores all face attributes from the @code{mouse-face} property
3151 that alter the text size (e.g., @code{:height}, @code{:weight}, and
3152 @code{:slant}). Those attributes are always the same as for the
3153 unhighlighted text.
3154
3155 @item fontified
3156 @kindex fontified @r{(text property)}
3157 This property says whether the text is ready for display. If
3158 @code{nil}, Emacs's redisplay routine calls the functions in
3159 @code{fontification-functions} (@pxref{Auto Faces}) to prepare this
3160 part of the buffer before it is displayed. It is used internally by
3161 the ``just in time'' font locking code.
3162
3163 @item display
3164 This property activates various features that change the
3165 way text is displayed. For example, it can make text appear taller
3166 or shorter, higher or lower, wider or narrow, or replaced with an image.
3167 @xref{Display Property}.
3168
3169 @item help-echo
3170 @kindex help-echo @r{(text property)}
3171 @cindex tooltip
3172 @anchor{Text help-echo}
3173 If text has a string as its @code{help-echo} property, then when you
3174 move the mouse onto that text, Emacs displays that string in the echo
3175 area, or in the tooltip window (@pxref{Tooltips,,, emacs, The GNU Emacs
3176 Manual}).
3177
3178 If the value of the @code{help-echo} property is a function, that
3179 function is called with three arguments, @var{window}, @var{object} and
3180 @var{pos} and should return a help string or @code{nil} for
3181 none. The first argument, @var{window} is the window in which
3182 the help was found. The second, @var{object}, is the buffer, overlay or
3183 string which had the @code{help-echo} property. The @var{pos}
3184 argument is as follows:
3185
3186 @itemize @bullet{}
3187 @item
3188 If @var{object} is a buffer, @var{pos} is the position in the buffer.
3189 @item
3190 If @var{object} is an overlay, that overlay has a @code{help-echo}
3191 property, and @var{pos} is the position in the overlay's buffer.
3192 @item
3193 If @var{object} is a string (an overlay string or a string displayed
3194 with the @code{display} property), @var{pos} is the position in that
3195 string.
3196 @end itemize
3197
3198 If the value of the @code{help-echo} property is neither a function nor
3199 a string, it is evaluated to obtain a help string.
3200
3201 You can alter the way help text is displayed by setting the variable
3202 @code{show-help-function} (@pxref{Help display}).
3203
3204 This feature is used in the mode line and for other active text.
3205
3206 @item keymap
3207 @cindex keymap of character
3208 @kindex keymap @r{(text property)}
3209 The @code{keymap} property specifies an additional keymap for
3210 commands. When this keymap applies, it is used for key lookup before
3211 the minor mode keymaps and before the buffer's local map.
3212 @xref{Active Keymaps}. If the property value is a symbol, the
3213 symbol's function definition is used as the keymap.
3214
3215 The property's value for the character before point applies if it is
3216 non-@code{nil} and rear-sticky, and the property's value for the
3217 character after point applies if it is non-@code{nil} and
3218 front-sticky. (For mouse clicks, the position of the click is used
3219 instead of the position of point.)
3220
3221 @item local-map
3222 @kindex local-map @r{(text property)}
3223 This property works like @code{keymap} except that it specifies a
3224 keymap to use @emph{instead of} the buffer's local map. For most
3225 purposes (perhaps all purposes), it is better to use the @code{keymap}
3226 property.
3227
3228 @item syntax-table
3229 The @code{syntax-table} property overrides what the syntax table says
3230 about this particular character. @xref{Syntax Properties}.
3231
3232 @item read-only
3233 @cindex read-only character
3234 @kindex read-only @r{(text property)}
3235 If a character has the property @code{read-only}, then modifying that
3236 character is not allowed. Any command that would do so gets an error,
3237 @code{text-read-only}. If the property value is a string, that string
3238 is used as the error message.
3239
3240 Insertion next to a read-only character is an error if inserting
3241 ordinary text there would inherit the @code{read-only} property due to
3242 stickiness. Thus, you can control permission to insert next to
3243 read-only text by controlling the stickiness. @xref{Sticky Properties}.
3244
3245 Since changing properties counts as modifying the buffer, it is not
3246 possible to remove a @code{read-only} property unless you know the
3247 special trick: bind @code{inhibit-read-only} to a non-@code{nil} value
3248 and then remove the property. @xref{Read Only Buffers}.
3249
3250 @item inhibit-read-only
3251 @kindex inhibit-read-only @r{(text property)}
3252 If a character has the property @code{inhibit-read-only}, and the
3253 buffer is read-only, editing the character in question is allowed.
3254
3255 @item invisible
3256 @kindex invisible @r{(text property)}
3257 A non-@code{nil} @code{invisible} property can make a character invisible
3258 on the screen. @xref{Invisible Text}, for details.
3259
3260 @item intangible
3261 @kindex intangible @r{(text property)}
3262 If a group of consecutive characters have equal and non-@code{nil}
3263 @code{intangible} properties, then you cannot place point between them.
3264 If you try to move point forward into the group, point actually moves to
3265 the end of the group. If you try to move point backward into the group,
3266 point actually moves to the start of the group.
3267
3268 If consecutive characters have unequal non-@code{nil}
3269 @code{intangible} properties, they belong to separate groups; each
3270 group is separately treated as described above.
3271
3272 When the variable @code{inhibit-point-motion-hooks} is non-@code{nil},
3273 the @code{intangible} property is ignored.
3274
3275 Beware: this property operates at a very low level, and affects a lot of code
3276 in unexpected ways. So use it with extreme caution. A common misuse is to put
3277 an intangible property on invisible text, which is actually unnecessary since
3278 the command loop will move point outside of the invisible text at the end of
3279 each command anyway. @xref{Adjusting Point}.
3280
3281 @item field
3282 @kindex field @r{(text property)}
3283 Consecutive characters with the same @code{field} property constitute a
3284 @dfn{field}. Some motion functions including @code{forward-word} and
3285 @code{beginning-of-line} stop moving at a field boundary.
3286 @xref{Fields}.
3287
3288 @item cursor
3289 @kindex cursor @r{(text property)}
3290 Normally, the cursor is displayed at the beginning or the end of any
3291 overlay and text property strings present at the current buffer
3292 position. You can place the cursor on any desired character of these
3293 strings by giving that character a non-@code{nil} @code{cursor} text
3294 property. In addition, if the value of the @code{cursor} property is
3295 an integer, it specifies the number of buffer's character
3296 positions, starting with the position where the overlay or the
3297 @code{display} property begins, for which the cursor should be
3298 displayed on that character. Specifically, if the value of the
3299 @code{cursor} property of a character is the number @var{n}, the
3300 cursor will be displayed on this character for any buffer position in
3301 the range @code{[@var{ovpos}..@var{ovpos}+@var{n})}, where @var{ovpos}
3302 is the overlay's starting position given by @code{overlay-start}
3303 (@pxref{Managing Overlays}), or the position where the @code{display}
3304 text property begins in the buffer.
3305
3306 In other words, the string character with the @code{cursor} property
3307 of any non-@code{nil} value is the character where to display the
3308 cursor. The value of the property says for which buffer positions to
3309 display the cursor there. If the value is an integer @var{n},
3310 the cursor is displayed there when point is anywhere between the
3311 beginning of the overlay or @code{display} property and @var{n}
3312 positions after that. If the value is anything else and
3313 non-@code{nil}, the cursor is displayed there only when point is at
3314 the beginning of the @code{display} property or at
3315 @code{overlay-start}.
3316
3317 @cindex cursor position for @code{display} properties and overlays
3318 When the buffer has many overlay strings (e.g., @pxref{Overlay
3319 Properties, before-string}) or @code{display} properties that are
3320 strings, it is a good idea to use the @code{cursor} property on these
3321 strings to cue the Emacs display about the places where to put the
3322 cursor while traversing these strings. This directly communicates to
3323 the display engine where the Lisp program wants to put the cursor, or
3324 where the user would expect the cursor.
3325
3326 @item pointer
3327 @kindex pointer @r{(text property)}
3328 This specifies a specific pointer shape when the mouse pointer is over
3329 this text or image. @xref{Pointer Shape}, for possible pointer
3330 shapes.
3331
3332 @item line-spacing
3333 @kindex line-spacing @r{(text property)}
3334 A newline can have a @code{line-spacing} text or overlay property that
3335 controls the height of the display line ending with that newline. The
3336 property value overrides the default frame line spacing and the buffer
3337 local @code{line-spacing} variable. @xref{Line Height}.
3338
3339 @item line-height
3340 @kindex line-height @r{(text property)}
3341 A newline can have a @code{line-height} text or overlay property that
3342 controls the total height of the display line ending in that newline.
3343 @xref{Line Height}.
3344
3345 @item wrap-prefix
3346 If text has a @code{wrap-prefix} property, the prefix it defines will
3347 be added at display time to the beginning of every continuation line
3348 due to text wrapping (so if lines are truncated, the wrap-prefix is
3349 never used). It may be a string or an image (@pxref{Other Display
3350 Specs}), or a stretch of whitespace such as specified by the
3351 @code{:width} or @code{:align-to} display properties (@pxref{Specified
3352 Space}).
3353
3354 A wrap-prefix may also be specified for an entire buffer using the
3355 @code{wrap-prefix} buffer-local variable (however, a
3356 @code{wrap-prefix} text-property takes precedence over the value of
3357 the @code{wrap-prefix} variable). @xref{Truncation}.
3358
3359 @item line-prefix
3360 If text has a @code{line-prefix} property, the prefix it defines will
3361 be added at display time to the beginning of every non-continuation
3362 line. It may be a string or an image (@pxref{Other Display
3363 Specs}), or a stretch of whitespace such as specified by the
3364 @code{:width} or @code{:align-to} display properties (@pxref{Specified
3365 Space}).
3366
3367 A line-prefix may also be specified for an entire buffer using the
3368 @code{line-prefix} buffer-local variable (however, a
3369 @code{line-prefix} text-property takes precedence over the value of
3370 the @code{line-prefix} variable). @xref{Truncation}.
3371
3372 @item modification-hooks
3373 @cindex change hooks for a character
3374 @cindex hooks for changing a character
3375 @kindex modification-hooks @r{(text property)}
3376 If a character has the property @code{modification-hooks}, then its
3377 value should be a list of functions; modifying that character calls
3378 all of those functions before the actual modification. Each function
3379 receives two arguments: the beginning and end of the part of the
3380 buffer being modified. Note that if a particular modification hook
3381 function appears on several characters being modified by a single
3382 primitive, you can't predict how many times the function will
3383 be called.
3384 Furthermore, insertion will not modify any existing character, so this
3385 hook will only be run when removing some characters, replacing them
3386 with others, or changing their text-properties.
3387
3388 If these functions modify the buffer, they should bind
3389 @code{inhibit-modification-hooks} to @code{t} around doing so, to
3390 avoid confusing the internal mechanism that calls these hooks.
3391
3392 Overlays also support the @code{modification-hooks} property, but the
3393 details are somewhat different (@pxref{Overlay Properties}).
3394
3395 @item insert-in-front-hooks
3396 @itemx insert-behind-hooks
3397 @kindex insert-in-front-hooks @r{(text property)}
3398 @kindex insert-behind-hooks @r{(text property)}
3399 The operation of inserting text in a buffer also calls the functions
3400 listed in the @code{insert-in-front-hooks} property of the following
3401 character and in the @code{insert-behind-hooks} property of the
3402 preceding character. These functions receive two arguments, the
3403 beginning and end of the inserted text. The functions are called
3404 @emph{after} the actual insertion takes place.
3405
3406 See also @ref{Change Hooks}, for other hooks that are called
3407 when you change text in a buffer.
3408
3409 @item point-entered
3410 @itemx point-left
3411 @cindex hooks for motion of point
3412 @kindex point-entered @r{(text property)}
3413 @kindex point-left @r{(text property)}
3414 The special properties @code{point-entered} and @code{point-left}
3415 record hook functions that report motion of point. Each time point
3416 moves, Emacs compares these two property values:
3417
3418 @itemize @bullet
3419 @item
3420 the @code{point-left} property of the character after the old location,
3421 and
3422 @item
3423 the @code{point-entered} property of the character after the new
3424 location.
3425 @end itemize
3426
3427 @noindent
3428 If these two values differ, each of them is called (if not @code{nil})
3429 with two arguments: the old value of point, and the new one.
3430
3431 The same comparison is made for the characters before the old and new
3432 locations. The result may be to execute two @code{point-left} functions
3433 (which may be the same function) and/or two @code{point-entered}
3434 functions (which may be the same function). In any case, all the
3435 @code{point-left} functions are called first, followed by all the
3436 @code{point-entered} functions.
3437
3438 It is possible to use @code{char-after} to examine characters at various
3439 buffer positions without moving point to those positions. Only an
3440 actual change in the value of point runs these hook functions.
3441
3442 The variable @code{inhibit-point-motion-hooks} can inhibit running the
3443 @code{point-left} and @code{point-entered} hooks, see @ref{Inhibit
3444 point motion hooks}.
3445
3446 @item composition
3447 @kindex composition @r{(text property)}
3448 This text property is used to display a sequence of characters as a
3449 single glyph composed from components. But the value of the property
3450 itself is completely internal to Emacs and should not be manipulated
3451 directly by, for instance, @code{put-text-property}.
3452
3453 @end table
3454
3455 @defvar inhibit-point-motion-hooks
3456 @anchor{Inhibit point motion hooks} When this variable is
3457 non-@code{nil}, @code{point-left} and @code{point-entered} hooks are
3458 not run, and the @code{intangible} property has no effect. Do not set
3459 this variable globally; bind it with @code{let}.
3460 @end defvar
3461
3462 @defvar show-help-function
3463 @anchor{Help display} If this variable is non-@code{nil}, it specifies a
3464 function called to display help strings. These may be @code{help-echo}
3465 properties, menu help strings (@pxref{Simple Menu Items},
3466 @pxref{Extended Menu Items}), or tool bar help strings (@pxref{Tool
3467 Bar}). The specified function is called with one argument, the help
3468 string to display. Tooltip mode (@pxref{Tooltips,,, emacs, The GNU Emacs
3469 Manual}) provides an example.
3470 @end defvar
3471
3472 @node Format Properties
3473 @subsection Formatted Text Properties
3474
3475 These text properties affect the behavior of the fill commands. They
3476 are used for representing formatted text. @xref{Filling}, and
3477 @ref{Margins}.
3478
3479 @table @code
3480 @item hard
3481 If a newline character has this property, it is a ``hard'' newline.
3482 The fill commands do not alter hard newlines and do not move words
3483 across them. However, this property takes effect only if the
3484 @code{use-hard-newlines} minor mode is enabled. @xref{Hard and Soft
3485 Newlines,, Hard and Soft Newlines, emacs, The GNU Emacs Manual}.
3486
3487 @item right-margin
3488 This property specifies an extra right margin for filling this part of the
3489 text.
3490
3491 @item left-margin
3492 This property specifies an extra left margin for filling this part of the
3493 text.
3494
3495 @item justification
3496 This property specifies the style of justification for filling this part
3497 of the text.
3498 @end table
3499
3500 @node Sticky Properties
3501 @subsection Stickiness of Text Properties
3502 @cindex sticky text properties
3503 @cindex inheritance, text property
3504
3505 Self-inserting characters normally take on the same properties as the
3506 preceding character. This is called @dfn{inheritance} of properties.
3507
3508 A Lisp program can do insertion with inheritance or without,
3509 depending on the choice of insertion primitive. The ordinary text
3510 insertion functions, such as @code{insert}, do not inherit any
3511 properties. They insert text with precisely the properties of the
3512 string being inserted, and no others. This is correct for programs
3513 that copy text from one context to another---for example, into or out
3514 of the kill ring. To insert with inheritance, use the special
3515 primitives described in this section. Self-inserting characters
3516 inherit properties because they work using these primitives.
3517
3518 When you do insertion with inheritance, @emph{which} properties are
3519 inherited, and from where, depends on which properties are @dfn{sticky}.
3520 Insertion after a character inherits those of its properties that are
3521 @dfn{rear-sticky}. Insertion before a character inherits those of its
3522 properties that are @dfn{front-sticky}. When both sides offer different
3523 sticky values for the same property, the previous character's value
3524 takes precedence.
3525
3526 By default, a text property is rear-sticky but not front-sticky; thus,
3527 the default is to inherit all the properties of the preceding character,
3528 and nothing from the following character.
3529
3530 You can control the stickiness of various text properties with two
3531 specific text properties, @code{front-sticky} and @code{rear-nonsticky},
3532 and with the variable @code{text-property-default-nonsticky}. You can
3533 use the variable to specify a different default for a given property.
3534 You can use those two text properties to make any specific properties
3535 sticky or nonsticky in any particular part of the text.
3536
3537 If a character's @code{front-sticky} property is @code{t}, then all
3538 its properties are front-sticky. If the @code{front-sticky} property is
3539 a list, then the sticky properties of the character are those whose
3540 names are in the list. For example, if a character has a
3541 @code{front-sticky} property whose value is @code{(face read-only)},
3542 then insertion before the character can inherit its @code{face} property
3543 and its @code{read-only} property, but no others.
3544
3545 The @code{rear-nonsticky} property works the opposite way. Most
3546 properties are rear-sticky by default, so the @code{rear-nonsticky}
3547 property says which properties are @emph{not} rear-sticky. If a
3548 character's @code{rear-nonsticky} property is @code{t}, then none of its
3549 properties are rear-sticky. If the @code{rear-nonsticky} property is a
3550 list, properties are rear-sticky @emph{unless} their names are in the
3551 list.
3552
3553 @defvar text-property-default-nonsticky
3554 This variable holds an alist which defines the default rear-stickiness
3555 of various text properties. Each element has the form
3556 @code{(@var{property} . @var{nonstickiness})}, and it defines the
3557 stickiness of a particular text property, @var{property}.
3558
3559 If @var{nonstickiness} is non-@code{nil}, this means that the property
3560 @var{property} is rear-nonsticky by default. Since all properties are
3561 front-nonsticky by default, this makes @var{property} nonsticky in both
3562 directions by default.
3563
3564 The text properties @code{front-sticky} and @code{rear-nonsticky}, when
3565 used, take precedence over the default @var{nonstickiness} specified in
3566 @code{text-property-default-nonsticky}.
3567 @end defvar
3568
3569 Here are the functions that insert text with inheritance of properties:
3570
3571 @defun insert-and-inherit &rest strings
3572 Insert the strings @var{strings}, just like the function @code{insert},
3573 but inherit any sticky properties from the adjoining text.
3574 @end defun
3575
3576 @defun insert-before-markers-and-inherit &rest strings
3577 Insert the strings @var{strings}, just like the function
3578 @code{insert-before-markers}, but inherit any sticky properties from the
3579 adjoining text.
3580 @end defun
3581
3582 @xref{Insertion}, for the ordinary insertion functions which do not
3583 inherit.
3584
3585 @node Lazy Properties
3586 @subsection Lazy Computation of Text Properties
3587
3588 Instead of computing text properties for all the text in the buffer,
3589 you can arrange to compute the text properties for parts of the text
3590 when and if something depends on them.
3591
3592 The primitive that extracts text from the buffer along with its
3593 properties is @code{buffer-substring}. Before examining the properties,
3594 this function runs the abnormal hook @code{buffer-access-fontify-functions}.
3595
3596 @defvar buffer-access-fontify-functions
3597 This variable holds a list of functions for computing text properties.
3598 Before @code{buffer-substring} copies the text and text properties for a
3599 portion of the buffer, it calls all the functions in this list. Each of
3600 the functions receives two arguments that specify the range of the
3601 buffer being accessed. (The buffer itself is always the current
3602 buffer.)
3603 @end defvar
3604
3605 The function @code{buffer-substring-no-properties} does not call these
3606 functions, since it ignores text properties anyway.
3607
3608 In order to prevent the hook functions from being called more than
3609 once for the same part of the buffer, you can use the variable
3610 @code{buffer-access-fontified-property}.
3611
3612 @defvar buffer-access-fontified-property
3613 If this variable's value is non-@code{nil}, it is a symbol which is used
3614 as a text property name. A non-@code{nil} value for that text property
3615 means, ``the other text properties for this character have already been
3616 computed''.
3617
3618 If all the characters in the range specified for @code{buffer-substring}
3619 have a non-@code{nil} value for this property, @code{buffer-substring}
3620 does not call the @code{buffer-access-fontify-functions} functions. It
3621 assumes these characters already have the right text properties, and
3622 just copies the properties they already have.
3623
3624 The normal way to use this feature is that the
3625 @code{buffer-access-fontify-functions} functions add this property, as
3626 well as others, to the characters they operate on. That way, they avoid
3627 being called over and over for the same text.
3628 @end defvar
3629
3630 @node Clickable Text
3631 @subsection Defining Clickable Text
3632 @cindex clickable text
3633 @cindex follow links
3634 @cindex mouse-1
3635
3636 @dfn{Clickable text} is text that can be clicked, with either the
3637 mouse or via a keyboard command, to produce some result. Many major
3638 modes use clickable text to implement textual hyper-links, or
3639 @dfn{links} for short.
3640
3641 The easiest way to insert and manipulate links is to use the
3642 @code{button} package. @xref{Buttons}. In this section, we will
3643 explain how to manually set up clickable text in a buffer, using text
3644 properties. For simplicity, we will refer to the clickable text as a
3645 @dfn{link}.
3646
3647 Implementing a link involves three separate steps: (1) indicating
3648 clickability when the mouse moves over the link; (2) making @key{RET}
3649 or @kbd{Mouse-2} on that link do something; and (3) setting up a
3650 @code{follow-link} condition so that the link obeys
3651 @code{mouse-1-click-follows-link}.
3652
3653 To indicate clickability, add the @code{mouse-face} text property to
3654 the text of the link; then Emacs will highlight the link when the
3655 mouse moves over it. In addition, you should define a tooltip or echo
3656 area message, using the @code{help-echo} text property. @xref{Special
3657 Properties}. For instance, here is how Dired indicates that file
3658 names are clickable:
3659
3660 @smallexample
3661 (if (dired-move-to-filename)
3662 (add-text-properties
3663 (point)
3664 (save-excursion
3665 (dired-move-to-end-of-filename)
3666 (point))
3667 '(mouse-face highlight
3668 help-echo "mouse-2: visit this file in other window")))
3669 @end smallexample
3670
3671 To make the link clickable, bind @key{RET} and @kbd{Mouse-2} to
3672 commands that perform the desired action. Each command should check
3673 to see whether it was called on a link, and act accordingly. For
3674 instance, Dired's major mode keymap binds @kbd{Mouse-2} to the
3675 following command:
3676
3677 @smallexample
3678 (defun dired-mouse-find-file-other-window (event)
3679 "In Dired, visit the file or directory name you click on."
3680 (interactive "e")
3681 (let ((window (posn-window (event-end event)))
3682 (pos (posn-point (event-end event)))
3683 file)
3684 (if (not (windowp window))
3685 (error "No file chosen"))
3686 (with-current-buffer (window-buffer window)
3687 (goto-char pos)
3688 (setq file (dired-get-file-for-visit)))
3689 (if (file-directory-p file)
3690 (or (and (cdr dired-subdir-alist)
3691 (dired-goto-subdir file))
3692 (progn
3693 (select-window window)
3694 (dired-other-window file)))
3695 (select-window window)
3696 (find-file-other-window (file-name-sans-versions file t)))))
3697 @end smallexample
3698
3699 @noindent
3700 This command uses the functions @code{posn-window} and
3701 @code{posn-point} to determine where the click occurred, and
3702 @code{dired-get-file-for-visit} to determine which file to visit.
3703
3704 Instead of binding the mouse command in a major mode keymap, you can
3705 bind it within the link text, using the @code{keymap} text property
3706 (@pxref{Special Properties}). For instance:
3707
3708 @example
3709 (let ((map (make-sparse-keymap)))
3710 (define-key map [mouse-2] 'operate-this-button)
3711 (put-text-property link-start link-end 'keymap map))
3712 @end example
3713
3714 @noindent
3715 With this method, you can easily define different commands for
3716 different links. Furthermore, the global definition of @key{RET} and
3717 @kbd{Mouse-2} remain available for the rest of the text in the buffer.
3718
3719 @vindex mouse-1-click-follows-link
3720 The basic Emacs command for clicking on links is @kbd{Mouse-2}.
3721 However, for compatibility with other graphical applications, Emacs
3722 also recognizes @kbd{Mouse-1} clicks on links, provided the user
3723 clicks on the link quickly without moving the mouse. This behavior is
3724 controlled by the user option @code{mouse-1-click-follows-link}.
3725 @xref{Mouse References,,, emacs, The GNU Emacs Manual}.
3726
3727 To set up the link so that it obeys
3728 @code{mouse-1-click-follows-link}, you must either (1) apply a
3729 @code{follow-link} text or overlay property to the link text, or (2)
3730 bind the @code{follow-link} event to a keymap (which can be a major
3731 mode keymap or a local keymap specified via the @code{keymap} text
3732 property). The value of the @code{follow-link} property, or the
3733 binding for the @code{follow-link} event, acts as a ``condition'' for
3734 the link action. This condition tells Emacs two things: the
3735 circumstances under which a @kbd{Mouse-1} click should be regarded as
3736 occurring ``inside'' the link, and how to compute an ``action code''
3737 that says what to translate the @kbd{Mouse-1} click into. The link
3738 action condition can be one of the following:
3739
3740 @table @asis
3741 @item @code{mouse-face}
3742 If the condition is the symbol @code{mouse-face}, a position is inside
3743 a link if there is a non-@code{nil} @code{mouse-face} property at that
3744 position. The action code is always @code{t}.
3745
3746 For example, here is how Info mode handles @key{Mouse-1}:
3747
3748 @smallexample
3749 (define-key Info-mode-map [follow-link] 'mouse-face)
3750 @end smallexample
3751
3752 @item a function
3753 If the condition is a function, @var{func}, then a position @var{pos}
3754 is inside a link if @code{(@var{func} @var{pos})} evaluates to
3755 non-@code{nil}. The value returned by @var{func} serves as the action
3756 code.
3757
3758 For example, here is how pcvs enables @kbd{Mouse-1} to follow links on
3759 file names only:
3760
3761 @smallexample
3762 (define-key map [follow-link]
3763 (lambda (pos)
3764 (eq (get-char-property pos 'face) 'cvs-filename-face)))
3765 @end smallexample
3766
3767 @item anything else
3768 If the condition value is anything else, then the position is inside a
3769 link and the condition itself is the action code. Clearly, you should
3770 specify this kind of condition only when applying the condition via a
3771 text or property overlay on the link text (so that it does not apply
3772 to the entire buffer).
3773 @end table
3774
3775 @noindent
3776 The action code tells @kbd{Mouse-1} how to follow the link:
3777
3778 @table @asis
3779 @item a string or vector
3780 If the action code is a string or vector, the @kbd{Mouse-1} event is
3781 translated into the first element of the string or vector; i.e., the
3782 action of the @kbd{Mouse-1} click is the local or global binding of
3783 that character or symbol. Thus, if the action code is @code{"foo"},
3784 @kbd{Mouse-1} translates into @kbd{f}. If it is @code{[foo]},
3785 @kbd{Mouse-1} translates into @key{foo}.
3786
3787 @item anything else
3788 For any other non-@code{nil} action code, the @kbd{Mouse-1} event is
3789 translated into a @kbd{Mouse-2} event at the same position.
3790 @end table
3791
3792 To define @kbd{Mouse-1} to activate a button defined with
3793 @code{define-button-type}, give the button a @code{follow-link}
3794 property. The property value should be a link action condition, as
3795 described above. @xref{Buttons}. For example, here is how Help mode
3796 handles @kbd{Mouse-1}:
3797
3798 @smallexample
3799 (define-button-type 'help-xref
3800 'follow-link t
3801 'action #'help-button-action)
3802 @end smallexample
3803
3804 To define @kbd{Mouse-1} on a widget defined with
3805 @code{define-widget}, give the widget a @code{:follow-link} property.
3806 The property value should be a link action condition, as described
3807 above. For example, here is how the @code{link} widget specifies that
3808 a @key{Mouse-1} click shall be translated to @key{RET}:
3809
3810 @smallexample
3811 (define-widget 'link 'item
3812 "An embedded link."
3813 :button-prefix 'widget-link-prefix
3814 :button-suffix 'widget-link-suffix
3815 :follow-link "\C-m"
3816 :help-echo "Follow the link."
3817 :format "%[%t%]")
3818 @end smallexample
3819
3820 @defun mouse-on-link-p pos
3821 This function returns non-@code{nil} if position @var{pos} in the
3822 current buffer is on a link. @var{pos} can also be a mouse event
3823 location, as returned by @code{event-start} (@pxref{Accessing Mouse}).
3824 @end defun
3825
3826 @node Fields
3827 @subsection Defining and Using Fields
3828 @cindex fields
3829
3830 A field is a range of consecutive characters in the buffer that are
3831 identified by having the same value (comparing with @code{eq}) of the
3832 @code{field} property (either a text-property or an overlay property).
3833 This section describes special functions that are available for
3834 operating on fields.
3835
3836 You specify a field with a buffer position, @var{pos}. We think of
3837 each field as containing a range of buffer positions, so the position
3838 you specify stands for the field containing that position.
3839
3840 When the characters before and after @var{pos} are part of the same
3841 field, there is no doubt which field contains @var{pos}: the one those
3842 characters both belong to. When @var{pos} is at a boundary between
3843 fields, which field it belongs to depends on the stickiness of the
3844 @code{field} properties of the two surrounding characters (@pxref{Sticky
3845 Properties}). The field whose property would be inherited by text
3846 inserted at @var{pos} is the field that contains @var{pos}.
3847
3848 There is an anomalous case where newly inserted text at @var{pos}
3849 would not inherit the @code{field} property from either side. This
3850 happens if the previous character's @code{field} property is not
3851 rear-sticky, and the following character's @code{field} property is not
3852 front-sticky. In this case, @var{pos} belongs to neither the preceding
3853 field nor the following field; the field functions treat it as belonging
3854 to an empty field whose beginning and end are both at @var{pos}.
3855
3856 In all of these functions, if @var{pos} is omitted or @code{nil}, the
3857 value of point is used by default. If narrowing is in effect, then
3858 @var{pos} should fall within the accessible portion. @xref{Narrowing}.
3859
3860 @defun field-beginning &optional pos escape-from-edge limit
3861 This function returns the beginning of the field specified by @var{pos}.
3862
3863 If @var{pos} is at the beginning of its field, and
3864 @var{escape-from-edge} is non-@code{nil}, then the return value is
3865 always the beginning of the preceding field that @emph{ends} at @var{pos},
3866 regardless of the stickiness of the @code{field} properties around
3867 @var{pos}.
3868
3869 If @var{limit} is non-@code{nil}, it is a buffer position; if the
3870 beginning of the field is before @var{limit}, then @var{limit} will be
3871 returned instead.
3872 @end defun
3873
3874 @defun field-end &optional pos escape-from-edge limit
3875 This function returns the end of the field specified by @var{pos}.
3876
3877 If @var{pos} is at the end of its field, and @var{escape-from-edge} is
3878 non-@code{nil}, then the return value is always the end of the following
3879 field that @emph{begins} at @var{pos}, regardless of the stickiness of
3880 the @code{field} properties around @var{pos}.
3881
3882 If @var{limit} is non-@code{nil}, it is a buffer position; if the end
3883 of the field is after @var{limit}, then @var{limit} will be returned
3884 instead.
3885 @end defun
3886
3887 @defun field-string &optional pos
3888 This function returns the contents of the field specified by @var{pos},
3889 as a string.
3890 @end defun
3891
3892 @defun field-string-no-properties &optional pos
3893 This function returns the contents of the field specified by @var{pos},
3894 as a string, discarding text properties.
3895 @end defun
3896
3897 @defun delete-field &optional pos
3898 This function deletes the text of the field specified by @var{pos}.
3899 @end defun
3900
3901 @defun constrain-to-field new-pos old-pos &optional escape-from-edge only-in-line inhibit-capture-property
3902 This function ``constrains'' @var{new-pos} to the field that
3903 @var{old-pos} belongs to---in other words, it returns the position
3904 closest to @var{new-pos} that is in the same field as @var{old-pos}.
3905
3906 If @var{new-pos} is @code{nil}, then @code{constrain-to-field} uses
3907 the value of point instead, and moves point to the resulting position
3908 in addition to returning that position.
3909
3910 If @var{old-pos} is at the boundary of two fields, then the acceptable
3911 final positions depend on the argument @var{escape-from-edge}. If
3912 @var{escape-from-edge} is @code{nil}, then @var{new-pos} must be in
3913 the field whose @code{field} property equals what new characters
3914 inserted at @var{old-pos} would inherit. (This depends on the
3915 stickiness of the @code{field} property for the characters before and
3916 after @var{old-pos}.) If @var{escape-from-edge} is non-@code{nil},
3917 @var{new-pos} can be anywhere in the two adjacent fields.
3918 Additionally, if two fields are separated by another field with the
3919 special value @code{boundary}, then any point within this special
3920 field is also considered to be ``on the boundary''.
3921
3922 Commands like @kbd{C-a} with no argument, that normally move backward
3923 to a specific kind of location and stay there once there, probably
3924 should specify @code{nil} for @var{escape-from-edge}. Other motion
3925 commands that check fields should probably pass @code{t}.
3926
3927 If the optional argument @var{only-in-line} is non-@code{nil}, and
3928 constraining @var{new-pos} in the usual way would move it to a different
3929 line, @var{new-pos} is returned unconstrained. This used in commands
3930 that move by line, such as @code{next-line} and
3931 @code{beginning-of-line}, so that they respect field boundaries only in
3932 the case where they can still move to the right line.
3933
3934 If the optional argument @var{inhibit-capture-property} is
3935 non-@code{nil}, and @var{old-pos} has a non-@code{nil} property of that
3936 name, then any field boundaries are ignored.
3937
3938 You can cause @code{constrain-to-field} to ignore all field boundaries
3939 (and so never constrain anything) by binding the variable
3940 @code{inhibit-field-text-motion} to a non-@code{nil} value.
3941 @end defun
3942
3943 @node Not Intervals
3944 @subsection Why Text Properties are not Intervals
3945 @cindex intervals
3946
3947 Some editors that support adding attributes to text in the buffer do
3948 so by letting the user specify ``intervals'' within the text, and adding
3949 the properties to the intervals. Those editors permit the user or the
3950 programmer to determine where individual intervals start and end. We
3951 deliberately provided a different sort of interface in Emacs Lisp to
3952 avoid certain paradoxical behavior associated with text modification.
3953
3954 If the actual subdivision into intervals is meaningful, that means you
3955 can distinguish between a buffer that is just one interval with a
3956 certain property, and a buffer containing the same text subdivided into
3957 two intervals, both of which have that property.
3958
3959 Suppose you take the buffer with just one interval and kill part of
3960 the text. The text remaining in the buffer is one interval, and the
3961 copy in the kill ring (and the undo list) becomes a separate interval.
3962 Then if you yank back the killed text, you get two intervals with the
3963 same properties. Thus, editing does not preserve the distinction
3964 between one interval and two.
3965
3966 Suppose we ``fix'' this problem by coalescing the two intervals when
3967 the text is inserted. That works fine if the buffer originally was a
3968 single interval. But suppose instead that we have two adjacent
3969 intervals with the same properties, and we kill the text of one interval
3970 and yank it back. The same interval-coalescence feature that rescues
3971 the other case causes trouble in this one: after yanking, we have just
3972 one interval. Once again, editing does not preserve the distinction
3973 between one interval and two.
3974
3975 Insertion of text at the border between intervals also raises
3976 questions that have no satisfactory answer.
3977
3978 However, it is easy to arrange for editing to behave consistently
3979 for questions of the form, ``What are the properties of text at this
3980 buffer or string position?'' So we have decided these are the only
3981 questions that make sense; we have not implemented asking questions
3982 about where intervals start or end.
3983
3984 In practice, you can usually use the text property search functions in
3985 place of explicit interval boundaries. You can think of them as finding
3986 the boundaries of intervals, assuming that intervals are always
3987 coalesced whenever possible. @xref{Property Search}.
3988
3989 Emacs also provides explicit intervals as a presentation feature; see
3990 @ref{Overlays}.
3991
3992 @node Substitution
3993 @section Substituting for a Character Code
3994
3995 The following functions replace characters within a specified region
3996 based on their character codes.
3997
3998 @defun subst-char-in-region start end old-char new-char &optional noundo
3999 @cindex replace characters
4000 This function replaces all occurrences of the character @var{old-char}
4001 with the character @var{new-char} in the region of the current buffer
4002 defined by @var{start} and @var{end}.
4003
4004 @cindex undo avoidance
4005 If @var{noundo} is non-@code{nil}, then @code{subst-char-in-region} does
4006 not record the change for undo and does not mark the buffer as modified.
4007 This was useful for controlling the old selective display feature
4008 (@pxref{Selective Display}).
4009
4010 @code{subst-char-in-region} does not move point and returns
4011 @code{nil}.
4012
4013 @example
4014 @group
4015 ---------- Buffer: foo ----------
4016 This is the contents of the buffer before.
4017 ---------- Buffer: foo ----------
4018 @end group
4019
4020 @group
4021 (subst-char-in-region 1 20 ?i ?X)
4022 @result{} nil
4023
4024 ---------- Buffer: foo ----------
4025 ThXs Xs the contents of the buffer before.
4026 ---------- Buffer: foo ----------
4027 @end group
4028 @end example
4029 @end defun
4030
4031 @deffn Command translate-region start end table
4032 This function applies a translation table to the characters in the
4033 buffer between positions @var{start} and @var{end}.
4034
4035 The translation table @var{table} is a string or a char-table;
4036 @code{(aref @var{table} @var{ochar})} gives the translated character
4037 corresponding to @var{ochar}. If @var{table} is a string, any
4038 characters with codes larger than the length of @var{table} are not
4039 altered by the translation.
4040
4041 The return value of @code{translate-region} is the number of
4042 characters that were actually changed by the translation. This does
4043 not count characters that were mapped into themselves in the
4044 translation table.
4045 @end deffn
4046
4047 @node Registers
4048 @section Registers
4049 @cindex registers
4050
4051 A register is a sort of variable used in Emacs editing that can hold a
4052 variety of different kinds of values. Each register is named by a
4053 single character. All @acronym{ASCII} characters and their meta variants
4054 (but with the exception of @kbd{C-g}) can be used to name registers.
4055 Thus, there are 255 possible registers. A register is designated in
4056 Emacs Lisp by the character that is its name.
4057
4058 @defvar register-alist
4059 This variable is an alist of elements of the form @code{(@var{name} .
4060 @var{contents})}. Normally, there is one element for each Emacs
4061 register that has been used.
4062
4063 The object @var{name} is a character (an integer) identifying the
4064 register.
4065 @end defvar
4066
4067 The @var{contents} of a register can have several possible types:
4068
4069 @table @asis
4070 @item a number
4071 A number stands for itself. If @code{insert-register} finds a number
4072 in the register, it converts the number to decimal.
4073
4074 @item a marker
4075 A marker represents a buffer position to jump to.
4076
4077 @item a string
4078 A string is text saved in the register.
4079
4080 @item a rectangle
4081 A rectangle is represented by a list of strings.
4082
4083 @item @code{(@var{window-configuration} @var{position})}
4084 This represents a window configuration to restore in one frame, and a
4085 position to jump to in the current buffer.
4086
4087 @c FIXME: Mention frameset here.
4088 @item @code{(@var{frame-configuration} @var{position})}
4089 This represents a frame configuration to restore, and a position
4090 to jump to in the current buffer.
4091
4092 @item (file @var{filename})
4093 This represents a file to visit; jumping to this value visits file
4094 @var{filename}.
4095
4096 @item (file-query @var{filename} @var{position})
4097 This represents a file to visit and a position in it; jumping to this
4098 value visits file @var{filename} and goes to buffer position
4099 @var{position}. Restoring this type of position asks the user for
4100 confirmation first.
4101 @end table
4102
4103 The functions in this section return unpredictable values unless
4104 otherwise stated.
4105
4106 @defun get-register reg
4107 This function returns the contents of the register
4108 @var{reg}, or @code{nil} if it has no contents.
4109 @end defun
4110
4111 @defun set-register reg value
4112 This function sets the contents of register @var{reg} to @var{value}.
4113 A register can be set to any value, but the other register functions
4114 expect only certain data types. The return value is @var{value}.
4115 @end defun
4116
4117 @deffn Command view-register reg
4118 This command displays what is contained in register @var{reg}.
4119 @end deffn
4120
4121 @deffn Command insert-register reg &optional beforep
4122 This command inserts contents of register @var{reg} into the current
4123 buffer.
4124
4125 Normally, this command puts point before the inserted text, and the
4126 mark after it. However, if the optional second argument @var{beforep}
4127 is non-@code{nil}, it puts the mark before and point after.
4128
4129 When called interactively, the command defaults to putting point after
4130 text, and a prefix argument inverts this behavior.
4131
4132 If the register contains a rectangle, then the rectangle is inserted
4133 with its upper left corner at point. This means that text is inserted
4134 in the current line and underneath it on successive lines.
4135
4136 If the register contains something other than saved text (a string) or
4137 a rectangle (a list), currently useless things happen. This may be
4138 changed in the future.
4139 @end deffn
4140
4141 @defun register-read-with-preview prompt
4142 @cindex register preview
4143 This function reads and returns a register name, prompting with
4144 @var{prompt} and possibly showing a preview of the existing registers
4145 and their contents. The preview is shown in a temporary window, after
4146 the delay specified by the user option @code{register-preview-delay},
4147 if its value and @code{register-alist} are both non-@code{nil}. The
4148 preview is also shown if the user requests help (e.g., by typing the
4149 help character). We recommend that all interactive commands which
4150 read register names use this function.
4151 @end defun
4152
4153 @node Transposition
4154 @section Transposition of Text
4155
4156 This function can be used to transpose stretches of text:
4157
4158 @defun transpose-regions start1 end1 start2 end2 &optional leave-markers
4159 This function exchanges two nonoverlapping portions of the buffer.
4160 Arguments @var{start1} and @var{end1} specify the bounds of one portion
4161 and arguments @var{start2} and @var{end2} specify the bounds of the
4162 other portion.
4163
4164 Normally, @code{transpose-regions} relocates markers with the transposed
4165 text; a marker previously positioned within one of the two transposed
4166 portions moves along with that portion, thus remaining between the same
4167 two characters in their new position. However, if @var{leave-markers}
4168 is non-@code{nil}, @code{transpose-regions} does not do this---it leaves
4169 all markers unrelocated.
4170 @end defun
4171
4172 @node Decompression
4173 @section Dealing With Compressed Data
4174
4175 When @code{auto-compression-mode} is enabled, Emacs automatically
4176 uncompresses compressed files when you visit them, and automatically
4177 recompresses them if you alter and save them. @xref{Compressed
4178 Files,,, emacs, The GNU Emacs Manual}.
4179
4180 The above feature works by calling an external executable (e.g.,
4181 @command{gzip}). Emacs can also be compiled with support for built-in
4182 decompression using the zlib library, which is faster than calling an
4183 external program.
4184
4185 @defun zlib-available-p
4186 This function returns non-@code{nil} if built-in zlib decompression is
4187 available.
4188 @end defun
4189
4190 @defun zlib-decompress-region start end
4191 This function decompresses the region between @var{start} and
4192 @var{end}, using built-in zlib decompression. The region should
4193 contain data that were compressed with gzip or zlib. On success, the
4194 function replaces the contents of the region with the decompressed
4195 data. On failure, the function leaves the region unchanged and
4196 returns @code{nil}. This function can be called only in unibyte
4197 buffers.
4198 @end defun
4199
4200
4201 @node Base 64
4202 @section Base 64 Encoding
4203 @cindex base 64 encoding
4204
4205 Base 64 code is used in email to encode a sequence of 8-bit bytes as
4206 a longer sequence of @acronym{ASCII} graphic characters. It is defined in
4207 Internet RFC@footnote{
4208 An RFC, an acronym for @dfn{Request for Comments}, is a numbered
4209 Internet informational document describing a standard. RFCs are
4210 usually written by technical experts acting on their own initiative,
4211 and are traditionally written in a pragmatic, experience-driven
4212 manner.
4213 }2045. This section describes the functions for
4214 converting to and from this code.
4215
4216 @deffn Command base64-encode-region beg end &optional no-line-break
4217 This function converts the region from @var{beg} to @var{end} into base
4218 64 code. It returns the length of the encoded text. An error is
4219 signaled if a character in the region is multibyte, i.e., in a
4220 multibyte buffer the region must contain only characters from the
4221 charsets @code{ascii}, @code{eight-bit-control} and
4222 @code{eight-bit-graphic}.
4223
4224 Normally, this function inserts newline characters into the encoded
4225 text, to avoid overlong lines. However, if the optional argument
4226 @var{no-line-break} is non-@code{nil}, these newlines are not added, so
4227 the output is just one long line.
4228 @end deffn
4229
4230 @defun base64-encode-string string &optional no-line-break
4231 This function converts the string @var{string} into base 64 code. It
4232 returns a string containing the encoded text. As for
4233 @code{base64-encode-region}, an error is signaled if a character in the
4234 string is multibyte.
4235
4236 Normally, this function inserts newline characters into the encoded
4237 text, to avoid overlong lines. However, if the optional argument
4238 @var{no-line-break} is non-@code{nil}, these newlines are not added, so
4239 the result string is just one long line.
4240 @end defun
4241
4242 @deffn Command base64-decode-region beg end
4243 This function converts the region from @var{beg} to @var{end} from base
4244 64 code into the corresponding decoded text. It returns the length of
4245 the decoded text.
4246
4247 The decoding functions ignore newline characters in the encoded text.
4248 @end deffn
4249
4250 @defun base64-decode-string string
4251 This function converts the string @var{string} from base 64 code into
4252 the corresponding decoded text. It returns a unibyte string containing the
4253 decoded text.
4254
4255 The decoding functions ignore newline characters in the encoded text.
4256 @end defun
4257
4258 @node Checksum/Hash
4259 @section Checksum/Hash
4260 @cindex MD5 checksum
4261 @cindex SHA hash
4262 @cindex hash, cryptographic
4263 @cindex cryptographic hash
4264
4265 Emacs has built-in support for computing @dfn{cryptographic hashes}.
4266 A cryptographic hash, or @dfn{checksum}, is a digital ``fingerprint''
4267 of a piece of data (e.g., a block of text) which can be used to check
4268 that you have an unaltered copy of that data.
4269
4270 @cindex message digest
4271 Emacs supports several common cryptographic hash algorithms: MD5,
4272 SHA-1, SHA-2, SHA-224, SHA-256, SHA-384 and SHA-512. MD5 is the
4273 oldest of these algorithms, and is commonly used in @dfn{message
4274 digests} to check the integrity of messages transmitted over a
4275 network. MD5 is not ``collision resistant'' (i.e., it is possible to
4276 deliberately design different pieces of data which have the same MD5
4277 hash), so you should not used it for anything security-related. A
4278 similar theoretical weakness also exists in SHA-1. Therefore, for
4279 security-related applications you should use the other hash types,
4280 such as SHA-2.
4281
4282 @defun secure-hash algorithm object &optional start end binary
4283 This function returns a hash for @var{object}. The argument
4284 @var{algorithm} is a symbol stating which hash to compute: one of
4285 @code{md5}, @code{sha1}, @code{sha224}, @code{sha256}, @code{sha384}
4286 or @code{sha512}. The argument @var{object} should be a buffer or a
4287 string.
4288
4289 The optional arguments @var{start} and @var{end} are character
4290 positions specifying the portion of @var{object} to compute the
4291 message digest for. If they are @code{nil} or omitted, the hash is
4292 computed for the whole of @var{object}.
4293
4294 If the argument @var{binary} is omitted or @code{nil}, the function
4295 returns the @dfn{text form} of the hash, as an ordinary Lisp string.
4296 If @var{binary} is non-@code{nil}, it returns the hash in @dfn{binary
4297 form}, as a sequence of bytes stored in a unibyte string.
4298
4299 This function does not compute the hash directly from the internal
4300 representation of @var{object}'s text (@pxref{Text Representations}).
4301 Instead, it encodes the text using a coding system (@pxref{Coding
4302 Systems}), and computes the hash from that encoded text. If
4303 @var{object} is a buffer, the coding system used is the one which
4304 would be chosen by default for writing the text into a file. If
4305 @var{object} is a string, the user's preferred coding system is used
4306 (@pxref{Recognize Coding,,, emacs, GNU Emacs Manual}).
4307 @end defun
4308
4309 @defun md5 object &optional start end coding-system noerror
4310 This function returns an MD5 hash. It is semi-obsolete, since for
4311 most purposes it is equivalent to calling @code{secure-hash} with
4312 @code{md5} as the @var{algorithm} argument. The @var{object},
4313 @var{start} and @var{end} arguments have the same meanings as in
4314 @code{secure-hash}.
4315
4316 If @var{coding-system} is non-@code{nil}, it specifies a coding system
4317 to use to encode the text; if omitted or @code{nil}, the default
4318 coding system is used, like in @code{secure-hash}.
4319
4320 Normally, @code{md5} signals an error if the text can't be encoded
4321 using the specified or chosen coding system. However, if
4322 @var{noerror} is non-@code{nil}, it silently uses @code{raw-text}
4323 coding instead.
4324 @end defun
4325
4326 @node Parsing HTML/XML
4327 @section Parsing HTML and XML
4328 @cindex parsing html
4329
4330 When Emacs is compiled with libxml2 support, the following functions
4331 are available to parse HTML or XML text into Lisp object trees.
4332
4333 @defun libxml-parse-html-region start end &optional base-url discard-comments
4334 This function parses the text between @var{start} and @var{end} as
4335 HTML, and returns a list representing the HTML @dfn{parse tree}. It
4336 attempts to handle ``real world'' HTML by robustly coping with syntax
4337 mistakes.
4338
4339 The optional argument @var{base-url}, if non-@code{nil}, should be a
4340 string specifying the base URL for relative URLs occurring in links.
4341
4342 If the optional argument @var{discard-comments} is non-@code{nil},
4343 then the parse tree is created without any comments.
4344
4345 In the parse tree, each HTML node is represented by a list in which
4346 the first element is a symbol representing the node name, the second
4347 element is an alist of node attributes, and the remaining elements are
4348 the subnodes.
4349
4350 The following example demonstrates this. Given this (malformed) HTML
4351 document:
4352
4353 @example
4354 <html><head></head><body width=101><div class=thing>Foo<div>Yes
4355 @end example
4356
4357 @noindent
4358 A call to @code{libxml-parse-html-region} returns this @acronym{DOM}
4359 (document object model):
4360
4361 @example
4362 (html nil
4363 (head nil)
4364 (body ((width . "101"))
4365 (div ((class . "thing"))
4366 "Foo"
4367 (div nil
4368 "Yes"))))
4369 @end example
4370 @end defun
4371
4372 @cindex rendering html
4373 @defun shr-insert-document dom
4374 This function renders the parsed HTML in @var{dom} into the current
4375 buffer. The argument @var{dom} should be a list as generated by
4376 @code{libxml-parse-html-region}. This function is, e.g., used by
4377 @ref{Top, EWW,, eww, The Emacs Web Wowser Manual}.
4378 @end defun
4379
4380 @cindex parsing xml
4381 @defun libxml-parse-xml-region start end &optional base-url discard-comments
4382 This function is the same as @code{libxml-parse-html-region}, except
4383 that it parses the text as XML rather than HTML (so it is stricter
4384 about syntax).
4385 @end defun
4386
4387 @menu
4388 * Document Object Model:: Access, manipulate and search the @acronym{DOM}.
4389 @end menu
4390
4391 @node Document Object Model
4392 @subsection Document Object Model
4393 @cindex HTML DOM
4394 @cindex XML DOM
4395 @cindex DOM
4396 @cindex Document Object Model
4397
4398 The @acronym{DOM} returned by @code{libxml-parse-html-region} (and the
4399 other @acronym{XML} parsing functions) is a tree structure where each
4400 node has a node name (called a @dfn{tag}), and optional key/value
4401 @dfn{attribute} list, and then a list of @dfn{child nodes}. The child
4402 nodes are either strings or @acronym{DOM} objects.
4403
4404 @example
4405 (body ((width . "101"))
4406 (div ((class . "thing"))
4407 "Foo"
4408 (div nil
4409 "Yes")))
4410 @end example
4411
4412 @defun dom-node tag &optional attributes &rest children
4413 This function creates a @acronym{DOM} node of type @var{tag}. If
4414 given, @var{attributes} should be a key/value pair list.
4415 If given, @var{children} should be @acronym{DOM} nodes.
4416 @end defun
4417
4418 The following functions can be used to work with this structure. Each
4419 function takes a @acronym{DOM} node, or a list of nodes. In the
4420 latter case, only the first node in the list is used.
4421
4422 Simple accessors:
4423
4424 @table @code
4425 @item dom-tag @var{node}
4426 Return the @dfn{tag} (also called ``node name'') of the node.
4427
4428 @item dom-attr @var{node} @var{attribute}
4429 Return the value of @var{attribute} in the node. A common usage
4430 would be:
4431
4432 @lisp
4433 (dom-attr img 'href)
4434 => "http://fsf.org/logo.png"
4435 @end lisp
4436
4437 @item dom-children @var{node}
4438 Return all the children of the node.
4439
4440 @item dom-non-text-children @var{node}
4441 Return all the non-string children of the node.
4442
4443 @item dom-attributes @var{node}
4444 Return the key/value pair list of attributes of the node.
4445
4446 @item dom-text @var{node}
4447 Return all the textual elements of the node as a concatenated string.
4448
4449 @item dom-texts @var{node}
4450 Return all the textual elements of the node, as well as the textual
4451 elements of all the children of the node, recursively, as a
4452 concatenated string. This function also takes an optional separator
4453 to be inserted between the textual elements.
4454
4455 @item dom-parent @var{dom} @var{node}
4456 Return the parent of @var{node} in @var{dom}.
4457 @end table
4458
4459 The following are functions for altering the @acronym{DOM}.
4460
4461 @table @code
4462 @item dom-set-attribute @var{node} @var{attribute} @var{value}
4463 Set the @var{attribute} of the node to @var{value}.
4464
4465 @item dom-append-child @var{node} @var{child}
4466 Append @var{child} as the last child of @var{node}.
4467
4468 @item dom-add-child-before @var{node} @var{child} @var{before}
4469 Add @var{child} to @var{node}'s child list before the @var{before}
4470 node. If @var{before} is nil, make @var{child} the first child.
4471
4472 @item dom-set-attributes @var{node} @var{attributes}
4473 Replace all the attributes of the node with a new key/value list.
4474 @end table
4475
4476 The following are functions for searching for elements in the
4477 @acronym{DOM}. They all return lists of matching nodes.
4478
4479 @table @code
4480 @item dom-by-tag @var{dom} @var{tag}
4481 Return all nodes in @var{dom} that are of type @var{tag}. A typical
4482 use would be:
4483
4484 @lisp
4485 (dom-by-tag dom 'td)
4486 => '((td ...) (td ...) (td ...))
4487 @end lisp
4488
4489 @item dom-by-class @var{dom} @var{match}
4490 Return all nodes in @var{dom} that have class names that match
4491 @var{match}, which is a regular expression.
4492
4493 @item dom-by-style @var{dom} @var{style}
4494 Return all nodes in @var{dom} that have styles that match @var{match},
4495 which is a regular expression.
4496
4497 @item dom-by-id @var{dom} @var{style}
4498 Return all nodes in @var{dom} that have IDs that match @var{match},
4499 which is a regular expression.
4500
4501 @end table
4502
4503 Utility functions:
4504
4505 @table @code
4506 @item dom-pp @var{dom} &optional @var{remove-empty}
4507 Pretty-print @var{dom} at point. If @var{remove-empty}, don't print
4508 textual nodes that just contain white-space.
4509 @end table
4510
4511
4512 @node Atomic Changes
4513 @section Atomic Change Groups
4514 @cindex atomic changes
4515
4516 In database terminology, an @dfn{atomic} change is an indivisible
4517 change---it can succeed entirely or it can fail entirely, but it
4518 cannot partly succeed. A Lisp program can make a series of changes to
4519 one or several buffers as an @dfn{atomic change group}, meaning that
4520 either the entire series of changes will be installed in their buffers
4521 or, in case of an error, none of them will be.
4522
4523 To do this for one buffer, the one already current, simply write a
4524 call to @code{atomic-change-group} around the code that makes the
4525 changes, like this:
4526
4527 @example
4528 (atomic-change-group
4529 (insert foo)
4530 (delete-region x y))
4531 @end example
4532
4533 @noindent
4534 If an error (or other nonlocal exit) occurs inside the body of
4535 @code{atomic-change-group}, it unmakes all the changes in that buffer
4536 that were during the execution of the body. This kind of change group
4537 has no effect on any other buffers---any such changes remain.
4538
4539 If you need something more sophisticated, such as to make changes in
4540 various buffers constitute one atomic group, you must directly call
4541 lower-level functions that @code{atomic-change-group} uses.
4542
4543 @defun prepare-change-group &optional buffer
4544 This function sets up a change group for buffer @var{buffer}, which
4545 defaults to the current buffer. It returns a ``handle'' that
4546 represents the change group. You must use this handle to activate the
4547 change group and subsequently to finish it.
4548 @end defun
4549
4550 To use the change group, you must @dfn{activate} it. You must do
4551 this before making any changes in the text of @var{buffer}.
4552
4553 @defun activate-change-group handle
4554 This function activates the change group that @var{handle} designates.
4555 @end defun
4556
4557 After you activate the change group, any changes you make in that
4558 buffer become part of it. Once you have made all the desired changes
4559 in the buffer, you must @dfn{finish} the change group. There are two
4560 ways to do this: you can either accept (and finalize) all the changes,
4561 or cancel them all.
4562
4563 @defun accept-change-group handle
4564 This function accepts all the changes in the change group specified by
4565 @var{handle}, making them final.
4566 @end defun
4567
4568 @defun cancel-change-group handle
4569 This function cancels and undoes all the changes in the change group
4570 specified by @var{handle}.
4571 @end defun
4572
4573 Your code should use @code{unwind-protect} to make sure the group is
4574 always finished. The call to @code{activate-change-group} should be
4575 inside the @code{unwind-protect}, in case the user types @kbd{C-g}
4576 just after it runs. (This is one reason why
4577 @code{prepare-change-group} and @code{activate-change-group} are
4578 separate functions, because normally you would call
4579 @code{prepare-change-group} before the start of that
4580 @code{unwind-protect}.) Once you finish the group, don't use the
4581 handle again---in particular, don't try to finish the same group
4582 twice.
4583
4584 To make a multibuffer change group, call @code{prepare-change-group}
4585 once for each buffer you want to cover, then use @code{nconc} to
4586 combine the returned values, like this:
4587
4588 @example
4589 (nconc (prepare-change-group buffer-1)
4590 (prepare-change-group buffer-2))
4591 @end example
4592
4593 You can then activate the multibuffer change group with a single call
4594 to @code{activate-change-group}, and finish it with a single call to
4595 @code{accept-change-group} or @code{cancel-change-group}.
4596
4597 Nested use of several change groups for the same buffer works as you
4598 would expect. Non-nested use of change groups for the same buffer
4599 will get Emacs confused, so don't let it happen; the first change
4600 group you start for any given buffer should be the last one finished.
4601
4602 @node Change Hooks
4603 @section Change Hooks
4604 @cindex change hooks
4605 @cindex hooks for text changes
4606
4607 These hook variables let you arrange to take notice of all changes in
4608 all buffers (or in a particular buffer, if you make them buffer-local).
4609 See also @ref{Special Properties}, for how to detect changes to specific
4610 parts of the text.
4611
4612 The functions you use in these hooks should save and restore the match
4613 data if they do anything that uses regular expressions; otherwise, they
4614 will interfere in bizarre ways with the editing operations that call
4615 them.
4616
4617 @defvar before-change-functions
4618 This variable holds a list of functions to call before any buffer
4619 modification. Each function gets two arguments, the beginning and end
4620 of the region that is about to change, represented as integers. The
4621 buffer that is about to change is always the current buffer.
4622 @end defvar
4623
4624 @defvar after-change-functions
4625 This variable holds a list of functions to call after any buffer
4626 modification. Each function receives three arguments: the beginning
4627 and end of the region just changed, and the length of the text that
4628 existed before the change. All three arguments are integers. The
4629 buffer that has been changed is always the current buffer.
4630
4631 The length of the old text is the difference between the buffer
4632 positions before and after that text as it was before the change. As
4633 for the changed text, its length is simply the difference between the
4634 first two arguments.
4635 @end defvar
4636
4637 Output of messages into the @file{*Messages*} buffer does not
4638 call these functions.
4639
4640 @defmac combine-after-change-calls body@dots{}
4641 The macro executes @var{body} normally, but arranges to call the
4642 after-change functions just once for a series of several changes---if
4643 that seems safe.
4644
4645 If a program makes several text changes in the same area of the buffer,
4646 using the macro @code{combine-after-change-calls} around that part of
4647 the program can make it run considerably faster when after-change hooks
4648 are in use. When the after-change hooks are ultimately called, the
4649 arguments specify a portion of the buffer including all of the changes
4650 made within the @code{combine-after-change-calls} body.
4651
4652 @strong{Warning:} You must not alter the values of
4653 @code{after-change-functions} within
4654 the body of a @code{combine-after-change-calls} form.
4655
4656 @strong{Warning:} if the changes you combine occur in widely scattered
4657 parts of the buffer, this will still work, but it is not advisable,
4658 because it may lead to inefficient behavior for some change hook
4659 functions.
4660 @end defmac
4661
4662 @defvar first-change-hook
4663 This variable is a normal hook that is run whenever a buffer is changed
4664 that was previously in the unmodified state.
4665 @end defvar
4666
4667 @defvar inhibit-modification-hooks
4668 If this variable is non-@code{nil}, all of the change hooks are
4669 disabled; none of them run. This affects all the hook variables
4670 described above in this section, as well as the hooks attached to
4671 certain special text properties (@pxref{Special Properties}) and overlay
4672 properties (@pxref{Overlay Properties}).
4673
4674 Also, this variable is bound to non-@code{nil} while running those
4675 same hook variables, so that by default modifying the buffer from
4676 a modification hook does not cause other modification hooks to be run.
4677 If you do want modification hooks to be run in a particular piece of
4678 code that is itself run from a modification hook, then rebind locally
4679 @code{inhibit-modification-hooks} to @code{nil}.
4680 @end defvar