Subject: | Content decoding |
Date: | Wed, 3 Feb 2010 15:49:19 +1000 |
To: | bug-Web-Scraper [...] rt.cpan.org |
From: | Andrew Whatson <whatson [...] gmail.com> |
Hi,
I've noticed that Web::Scraper doesn't handle HTTP::Response objects with a
'content-encoding' of gzip (and presumably others as well). Poking through
the code, it seems to be because an attempt is made at decoding the content
manually instead of using $http_response->decoded_content, and this manual
decoding checks 'content-type' but ignores 'content-encoding'. A patch is
attached that removes all attempts to decode content inside Web::Scraper and
instead trusts the HTTP::Response object to decode its content accurately.
Thanks,
Andrew
Message body is not shown because sender requested not to inline it.