Subject: | Correction to the documentation, under "Caveats" |
In the documentation under "Caveats", it appears that the phrase "make sure that the input data really is a utf8 string" is incorrect. Note that utf8 is a variable-length encoding, whereas Text::Unidecode wants a fixed length (two-byte) encoding for each character.
To fix this, you could phrase it as "make sure that the input data really is a string of two-byte Unicode characters". This is also referred to as UCS-2 in case you want to include that moniker.
How about if we also provide a tip on how to convert strings which really are utf8. You would do it like so:
my $decode_status = utf8::decode($input_to_be_converted);
my $converted_string = unidecode ($input_to_be_converted);