Subject: | UTF-8 not iso8859-1 |
Hi,
Many thanks for your HTML::WikiConverter, very impressive.
Unfortunately, I haven't found anything about coding and UTF-8.
When using your Web form and calling an external URL (HTML in UTF-8) to be converted in wiki, I do not really get my "e-acute"s in the Wiki output area but the double 8 bits characters corresponding to e-acute in UTF-8. That's not fully operational but not wrong.
Now when I use your html2wiki perl script on the same raw html file, I get "démographiques" for "démographiques", i.e the translation (from probably iso-8859-1) of both 8 bits characters into Atilde and copy.
I tried to look at the source code but I don't know where the transformation is done. I mean during which phases.
Also, where can I specify that the source is in UTF-8 and not iso-8859-1 . Any environmemt? or meta in the html source?
Many thanks,
Nicolas