On Mon, Jan 21, 2008 at 03:37:55PM -0500, Geoffrey Leach via RT wrote:
Show quoted text>
> Mon Jan 21 15:37:49 2008: Request 32489 was acted upon.
> Transaction: Ticket created by GLEACH
> Queue: Regexp-Common
> Subject: %xx not honored
> Broken in: 2.120
> Severity: Important
> Owner: Nobody
> Requestors: GLEACH@cpan.org
> Status: new
> Ticket <URL:
http://rt.cpan.org/Ticket/Display.html?id=32489 >
>
>
> use Regexp::Common qw /URI/;
> $url = "
http://spamassassin.apache%2Eorg/";
> @parsed = $url =~ /$RE{URI}{HTTP}{-keep}/;
> print join("\n", @parsed), "\n";
>
> result:
>
http://spamassassin.apache
> http
> spamassassin.apache
>
> %2E maps to '.'
%2E maps to '.', but only in the path part of an HTTP URL. There is
*no* escaping in the hostport part of the URL.
Not parsing of %2E in the host part is hence correct.
See also the grammar in section 5 of RFC 1738.
Regards,
Abigail