Subject: | New Error Message because of UTF8 Encoded Data |
Date: | Thu, 5 Jun 2008 10:11:58 +0200 |
To: | <bug-Finance-Quote [...] rt.cpan.org> |
From: | "Bartschies, Thomas" <Thomas.Bartschies [...] cvk.de> |
Hello,
since today we're getting the following error message when using Finance::Quote:
Parsing of undecoded UTF-8 will give garbage when decoding entities at /usr/lib/perl5/vendor_perl/5.8.6/Finance/Quote.pm line 242.
We're trying to get currency data. No Stock data. Because we using the round robin mode of the
module, we're not sure from which Site we're actually getting the data. Most likely from a yahoo Site though.
We've already found out that the message comes from the HTML::Parser Module, that is used by
HTML::TableExtract. You can in fact switch the Parser Module to utf8 Mode. But because the HTML::Extract
Module doesn't give the Parser handle, we'd have to activate it in HTML::Extract directly.
We're reluctant to change these standard Modules, because there might be a way to convert the
data before giving them to the HTML::Extract module. This would be the an elegant way to solve
the problem.
Could you implement this, or solve the problem in another way?
Best regards,
--
i. A. Thomas Bartschies
IT Systeme
Cornelsen Verlagskontor GmbH & Co. KG
Kammerratsheide 66, 33609 Bielefeld
Telefon 0521.9719-310
Telefax 0521.9719-93310
http://www.cvk.de
AG Bielefeld HRA 10578 - Geschäftsführer: Horst Keplinger
Geschäftsführende Komplementärin: AG Bielefeld HRB 7107 - Cornelsen Verlagskontor Verwaltungs-GmbH
Weitere Komplementärin: AG Charlottenburg HRA 20764 - Cornelsen Verlagsholding GmbH & Co., Berlin