Skip Menu |

This queue is for tickets about the XML-RSS-FromHTML-Simple CPAN distribution.

Report information
The Basics
Id: 47646
Status: resolved
Priority: 0/
Queue: XML-RSS-FromHTML-Simple

People
Owner: Nobody in particular
Requestors: bem [...] residents.frmug.org
Cc:
AdminCc:

Bug Information
Severity: (no value)
Broken in: (no value)
Fixed in: (no value)



Subject: utf-8 - iso 8859-1
Date: Tue, 7 Jul 2009 15:54:02 +0200
To: bug-XML-RSS-FromHTML-Simple [...] rt.cpan.org
From: bem <bem [...] residents.frmug.org>
package XML::RSS::FromHTML::Simple $VERSION = "0.04" perl 5.8.8 I have an encoding problem dealing with a title in utf-8 and iso 8859-1 htlm source. My title is encoded as the html is iso 8859-1. code extract : my $proc = XML::RSS::FromHTML::Simple->new({ title => "DDM - AFP média", url => "http://www.ddm.gouv.fr/afp/francais/topics/internet/index.php", base_url => "http://www.ddm.gouv.fr/afp/francais/topics/internet/", rss_file => "/var/www/rss/DDM_AFP.rss", }); Thanks.
Show quoted text
> My title is encoded as the html is iso 8859-1.
The RSS feed expects utf8, not iso-8859-1. If you have the title in iso-8859-1, use use Encode qw(from_to); from_to($title, "iso-8859-1", "utf-8"); to convert it first.