Subject: | doesn't catch meta refresh links |
The current HTML::LinkExtor doesn't get links found in:
<meta http-equiv="Refresh" content="0; url=/index.html>
i've added a workaround in a subclass i've written:
(in the constructor)
my @meta = ("content");
$HTML::Tagset::linkElements{'meta'} = [ @meta ] ;
you need to add code in the links callback to differ other meta tags
without links:
if ( $tag =~ /meta/i ) {
if ($value =~ /(.*)\d+;%20+url=([^\"\'\s)]+)/) {
$value = $1 . $2;
}
else { next; }
}
print $value;