Skip Menu |

This queue is for tickets about the Lingua-Stem-Snowball CPAN distribution.

Report information
The Basics
Id: 81373
Status: rejected
Priority: 0/
Queue: Lingua-Stem-Snowball

People
Owner: Nobody in particular
Requestors: lvaliukas [...] cyber.law.harvard.edu
Cc:
AdminCc:

Bug Information
Severity: (no value)
Broken in: (no value)
Fixed in: (no value)



Subject: Lingua::Stem::Snowball 0.96 + Lithuanian language support (patch attached)
Date: Fri, 23 Nov 2012 13:28:31 +0200
To: bug-Lingua-Stem-Snowball [...] rt.cpan.org, Marvin Humphrey <marvin [...] rectangular.com>
From: Linas Valiukas <lvaliukas [...] cyber.law.harvard.edu>
Hi Marvin, I have added Lithuanian language support to Lingua::Stem::Snowball and bumped the version number to 0.96. Please see https://github.com/pypt/Lingua-Stem-Snowball-plus-Lithuanian/commit/bf7bbd0f86f325af6eccf4f39a44b1674ee0b4fe for the diff. I have also attached the diff to this email. It would be great if you could find some time to push the updated module with the Lithuanian language support to CPAN. Please let me know if you need any other changes from me and / or you have questions or comments. Regards, -- Linas Valiukas Media Cloud project (www.mediacloud.org)

Message body is not shown because sender requested not to inline it.

Hi, Adding a Lithuanian stemmer is a nice idea. However, where does this code come from? All the stemmers in the Lingua::Stem::Snowball bundle are from the Snowball project directly (at snowball.tartarus.org). Why not submit the project upstream? (Is it because of the Academic Free License?) That would make things easier for us. I would prefer to take on only stemmers which have been deemed acceptable by Porter and Boulton, as I don't have the expertise to assess stemmer quality myself. If you would like to make your own CPAN release (Lingua::Stem::Lt?), you are free to reuse the code that is in Lingua::Stem::Snowball under the terms of the "Perl" license (GPL/Artistic). (Unfortunately this differs from the BSD license of the Snowball library itself, but it was inherited code so not my choice.) Best, Marvin Humphrey
Closing, since the Lithuanian stemmer was released on CPAN in a separate distro. Congrats!