Senga Home Page Information retrieval software GNU
Savannah
FSF France

unaccent

NAME
SYNOPSIS
DESCRIPTION
OPTIONS
EXAMPLES
SEE ALSO
AUTHOR

NAME

unaccent - remove accents from input stream or a string

SYNOPSIS

unaccent [--debug_low] [--debug_high] [-h] charset [string] [expected]

DESCRIPTION

With a single argument, unaccent reads data from stdin, replaces accented letters by their unaccented equivalent and writes the result on stdout. If the second argument ('string') is provided unaccent transforms it by replacing accented letters by their unaccented equivalent. The result is printed on the standard output. The charset of the input string or the data read from stdin is specified by the 'charset' argument (ISO-8859-15 for instance). The output is printed using the same charset.
If the 'expected' argument is provided, the output string is compared to it. If they are not equal unaccent exits on error.
unaccent relies on the iconv(3) library to convert from the specified charset to UTF-16BE (or UTF-16 if UTF-16BE is not available). You should check the manual pages for available charsets. On GNU/Linux the command
iconv -l
shows all available charsets.

OPTIONS

--debug_low
Prints human readable information about the unaccentuation process. See unac(3) for more information.
--debug_high
Prints very detailed information about the unaccentuation process. See unac(3) for more information.
--help -h
Prints a short usage and exits.

EXAMPLES

Remove accents from the string été and check that the result is ete.
unaccent ISO-8859-1 été ete
Remove accents from file myfile and put the result in file myfile.unaccent
unaccent ISO-8859-1 < myfile > myfile.unaccent

SEE ALSO

unac(3), iconv(3)

AUTHOR

Loic Dachary loic@senga.org
http://www.senga.org/unac/

 
unac
Home
Description
Documentation
Download
Debian
RPM
Perl
PHP
License
CVS
Mailing lists
Tasks
Contribute
Freshmeat
Savannah
Projects
Catalog
GNU Mifluz
unac
uri
webbase
Senga
Home
Old News
Credits
Team
Ducks
XHTML Source  |   XSL Style Sheet  
 webmaster@senga.org
Copyright (C) 2002 Loic Dachary, 12 bd Magenta, 75010 Paris, France
Verbatim copying and distribution of this entire article is permitted in any medium, provided this notice is preserved.