Skip Menu |

This queue is for tickets about the Parse-MediaWikiDump CPAN distribution.

Report information
The Basics
Id: 16616
Status: resolved
Priority: 0/
Queue: Parse-MediaWikiDump

People
Owner: triddle [...] cpan.org
Requestors: jmrukkers [...] yahoo.com
Cc:
AdminCc:

Bug Information
Severity: Important
Broken in: 0.22
Fixed in: (no value)

Attachments


Subject: non-English Categories not properly detected
When parsing files other than the en file, the Categories are not properly identified and returned. This is caused by the fact that the category selector uses a fixed string, Category: as an anchor to select the category entries. This is not a fixed term; in other languages, the anchor needs to be localized to the relevant language version. Luckily, the anchor can be easily found in the xml file's initial namepace element (under <namespace key="14">Category</namespace>). Regards -- Johannes
[guest - Mon Dec 19 15:50:57 2005]: Show quoted text
> When parsing files other than the en file, the Categories are not > properly identified and returned. This is caused by the fact that > the category selector uses a fixed string, Category: as an anchor > to select the category entries. This is not a fixed term; in other > languages, the anchor needs to be localized to the relevant > language version. Luckily, the anchor can be easily found in the > xml file's initial namepace element (under <namespace > key="14">Category</namespace>). > > Regards -- Johannes
I think the attached version should solve that problem. I did a quick test on an English and German dump file and was able to pull the categories out of both. Thanks for the bug report, and how to solve it, that is a slightly obscure problem; if this version fixes the problem for you please let me know and I'll publish this as the next version of Parse::MediaWikiDump. Tyler Riddle
Download Parse-MediaWikiDump-0.24.tar.gz
application/x-gzip 11.7k

Message body not shown because it is not plain text.

Hello Tyler I tested your solution with a few additional languages, and it worked like a dream, thank you for the quick fix! Regards -- Johannes