Subject: | Mishandled ASCII in IMDB::Film |
Date: | Thu, 27 Aug 2009 09:57:26 +0200 |
To: | <bug-imdb-film [...] rt.cpan.org> |
From: | paul ginzberg <paulginzberg [...] hotmail.com> |
Hello,
Special ASCII characters such as accents and most punctuation aren't displayed correctly. When extracting information from the IMDB HTML, relevant fields are copied verbatim, and special characters appear as, say '' where 12 is the hexadecimal ascii code of the character.
I suggest using the following regular expression on all fields to fix this: s/\&\#x(\w\w)\;/chr(hex $1)/eg
I believe that this bug will be reproduced on all systems, but here are my details:
Perl v5.10.0
MS Vista Home Premium SP1
I hope that I have been of help
Regards
Show quoted text
_________________________________________________________________
Share your memories online with anyone you want.
http://www.microsoft.com/middleeast/windows/windowslive/products/photos-share.aspx?tab=1