Skip Menu |

This queue is for tickets about the LWP-Charset CPAN distribution.

Report information
The Basics
Id: 16838
Status: resolved
Priority: 0/
Queue: LWP-Charset

People
Owner: Nobody in particular
Requestors: njh [...] bandsman.co.uk
Cc:
AdminCc:

Bug Information
Severity: Important
Broken in: 0.05
Fixed in: (no value)



Subject: Can't find the character set in some sites
Try running LWP::Charset on http://home4.highway.ne.jp/akdaruma/vivid-home.html It fails to find the character set in <META HTTP-EQUIV="Content-Type" CONTENT="text/html;CHARSET=x-sjis"> I don't know if it's the lack of space after the semi-colon or the incorrect quotation marks around the content field (why don't webmasters read RFCs??).
It seems to be a common problem. Here's another example from http://www.hcyb.com/ which LWP::Charset fails to parse: <META http-equiv="Content-Type" content="text/html; charset=iso-8859-1">