aboutsummaryrefslogtreecommitdiffstats
path: root/doc/gawk.texi
diff options
context:
space:
mode:
authorArnold D. Robbins <arnold@skeeve.com>2014-07-04 13:07:13 -0400
committerArnold D. Robbins <arnold@skeeve.com>2014-07-04 13:07:13 -0400
commitf6e8ca91a0046536ed5ffef60bb818d674fadf54 (patch)
treeca325e199a67a891dc9a65b5ca25ee36dad6f44d /doc/gawk.texi
parentaa06b223e89872515f5abe08566c32cae2af3024 (diff)
downloadegawk-f6e8ca91a0046536ed5ffef60bb818d674fadf54.tar.gz
egawk-f6e8ca91a0046536ed5ffef60bb818d674fadf54.tar.bz2
egawk-f6e8ca91a0046536ed5ffef60bb818d674fadf54.zip
Document how to match ASCII characters.
Diffstat (limited to 'doc/gawk.texi')
-rw-r--r--doc/gawk.texi11
1 files changed, 11 insertions, 0 deletions
diff --git a/doc/gawk.texi b/doc/gawk.texi
index 08fca873..c6520ba3 100644
--- a/doc/gawk.texi
+++ b/doc/gawk.texi
@@ -5457,6 +5457,17 @@ With the POSIX character classes, you can write
@code{/[[:alnum:]]/} to match the alphabetic
and numeric characters in your character set.
+@c Thanks to
+@c Date: Tue, 01 Jul 2014 07:39:51 +0200
+@c From: Hermann Peifer <peifer@gmx.eu>
+Some utilities that match regular expressions provide a non-standard
+@code{[:ascii:]} character class; @command{awk} does not. However, you
+can simulate such a construct using @code{[\x00-\x7F]}. This matches
+all values numerically between zero and 127, which is the defined
+range of the ASCII character set. Use a complemented character list
+(@code{[^\x00-\x7F]}) to match any single-byte characters that are not
+in the ASCII range.
+
@cindex bracket expressions, collating elements
@cindex bracket expressions, non-ASCII
@cindex collating elements