Subject: | German stemmer does not terminate for specific word form |
Date: | Mon, 23 Aug 2010 13:30:37 +0200 |
To: | bug-Lingua-Stem [...] rt.cpan.org |
From: | Jeremy Jancsary <jeremy.jancsary [...] ofai.at> |
Hi,
I recently noticed that the German ('DE') variant of Lingua::Stem does not
seem to terminate for a specific input, the word form 'anstrengendstem'. The
problem also occurs for similar word forms such as 'aufregendstem', so I
assume that the suffix plays a role.
Here is a minimal example that illustrates the problem:
use Lingua::Stem;
my $stemmer = new Lingua::Stem({'-locale' => 'DE'});
$stemmer->stem("anstrengendstem");
print STDERR "I'm never reached!\n";
exit 0;
I'm using Lingua-Stem-0.84 on perl, v5.10.1 (*) built for
x86_64-linux-gnu-thread-multi.
I consider this a rather critical bug, as it is hard to protect against
failure of termination, and the consequences are severe (especially within
server applications).
Cheers,
Jeremy