Subject: | Invalid parsing of META refresh tags |
The split function that is parsing the content attribute in a meta refresh tag is casesensitive. When using the URL tag in lowercase it doesn't return anything.
--- LinkExtractor.pm.org 2005-01-06 09:45:02.000000000 +0100
+++ LinkExtractor.pm 2005-01-06 09:33:58.000000000 +0100
@@ -163,7 +163,7 @@
defined $$NL{'name'} && $$NL{'name'} =~ /refresh/i
) ) {
- my( $timeout, $url ) = split m{;\s*?URL=}, $$NL{content},2;
+ my( $timeout, $url ) = split m{;\s*?URL=}i, $$NL{content},2;
my $base = $self->{_base};
$$NL{url} = URI->new_abs( $url, $base ) if $base;
$$NL{url} = $url unless exists $$NL{url};