There is no support for categories, as well the Wikipedia category graph is a nightmare,
which is specifically why I removed support from MediaWiki::DumpFile. This has not been
solved by anyone afaik.
Please also understand that the ticket system is for bugs related to my software, not
developer support.
Cheers,
Tyler
On Wed Jun 09 12:17:01 2010, syed.yasin@gmail.com wrote:
Show quoted text> Hi Tyler, Greetings!
>
> What are the changes to identify "Categories" in the current version.
> I am
> using "Parse::MediaWikiDump" along with modified wikiprep as told.
>
> My requirement is to only process "Medical/Health" Related articles.
>
> Warm Regards,
> Syed Yasin
>
> On Sun, Jun 6, 2010 at 4:33 PM, Syed Yasin <syed.yasin@gmail.com>
> wrote:
>
> > Hi Tyler, Greetings!
> >
> > Your inputs were of immense help, I am glad the issue is resolved -
> Thanks
> > again!
> >
> > Warm Regards,
> > Syed Yasin
> >
> > On Sun, Jun 6, 2010 at 5:52 AM, Syed Yasin <syed.yasin@gmail.com>
> wrote:
> >
> >> Thankyou very much again. I will try to first take a short cut by
> changing
> >> wikiprep accordingly. If this works out I should be very glad else
> will have
> >> to find an alternate solution.
> >>
> >> Warm Regards,
> >> Syed Yasin
> >> Sent from my iPhone, pls ignore typo's
> >>
> >>
> >> On Jun 6, 2010, at 5:11 AM, "Tyler Riddle via RT" <
> >> bug-Parse-MediaWikiDump@rt.cpan.org> wrote:
> >>
> >> <URL:
https://rt.cpan.org/Ticket/Display.html?id=58169 >
> >>>
> >>> Parse::MediaWikiDump and MediaWiki::DumpFile both have either
> limited or
> >>> no
> >>> understanding of Mediawiki internals by design; in order to
> generate data
> >>> such as Wikiprep
> >>> you would have to either just stick with Wikiprep and change all
> >>> instances of page to next or
> >>> recreate Wikiprep which is probably a waste of time.
> >>>
> >>> That being said, MediaWiki::DumpFile::Pages has the cleanest API
> and
> >>> performs the fastest
> >>> and should be used for all new projects.
> >>>
> >>> Cheers,
> >>>
> >>> Tyler
> >>>
> >>> On Sat Jun 05 19:36:54 2010, syed.yasin@gmail.com wrote:
> >>>
> >>>> Hi thanks very much for the speedy reply, can you please
> highlight more
> >>>> information about the exact/optimum way to parse wiki as of date.
> My
> >>>> requirement is something similar to what wikiprep was designed to
> >>>> extract.
> >>>>
> >>>> Warm Regards,
> >>>> Syed Yasin
> >>>>
> >>>> On Sun, Jun 6, 2010 at 5:00 AM, Tyler Riddle via RT <
> >>>> bug-Parse-MediaWikiDump@rt.cpan.org> wrote:
> >>>>
> >>>> <URL:
https://rt.cpan.org/Ticket/Display.html?id=58169 >
> >>>>>
> >>>>> The "page" method was depreciated a very long time ago and
> finally
> >>>>> removed
> >>>>> - it was replaced
> >>>>> with the "next" method which functions identically. It seems
> wikiprep
> >>>>> is
> >>>>> out of date, this is not
> >>>>> an issue with version 0.4 dump files or Parse::MediaWikiDump.
> >>>>>
> >>>>> Additionally Parse::MediaWikiDump itself has been depreciated
> for
> >>>>> MediaWiki::DumpFile which
> >>>>> offers a backwards compatible interface with twice the
> throughput of
> >>>>> Parse::MediaWikiDump::Pages.
> >>>>>
> >>>>> Cheers,
> >>>>>
> >>>>> Tyler
> >>>>>
> >>>>>
> >>>>
> >>>>
> >>>>
> >>>
> >>>
> >>>
> >
> >
> > --
> > Regards,
> > Syed Yasin
> >
>
>
>