Subject: | more internationalization |
Hi Iain,
I have a couple more things to add as far as internationalization goes...
in the get_extent sub:
i have seen "of", "/", "von" (german), "de" (spanish), and "di" (italian) on /messages/1 pages, so i made the following change:
< (\d+)-\d+ \s+ (?:of|/) \s+
---
Show quoted text
> (\d+)-\d+ \s+ (?:of|/|de|von|di) \s+
you may be able to get away with replacing all those with a "\S+" but that may be too vague a regex... i havent tested it yet.
in the fetch_message sub:
when checking whether its hit one of those annoying ads, instead of searching for advertising specific text, we could search the current uri to see if it includes /interrupt?st... as all ads seem to.
< if ($content =~ /\QYahoo! Groups is an advertising supported service.\E/gsm)
---
Show quoted text> my $curi = $w->uri();
> if ($curi =~ /\/interrupt\?st/gsm)
and instead of follow(), we could use the "follow_link()" method from WWW::Mechanize which includes a way to search urls...
< $w->follow('Continue to message');
---
Show quoted text> $w->follow_link( url_regex => qr/\/message\//i );
the last two changes should work for every language. ive tested them with english, german, italian, portugese (brazil), and spanish.
cheers,
ray cielencki
pause: slinky