[guest - Sun Dec 18 16:08:02 2005]:
Show quoted text> Hi, I found what is probably a bug in parsing through the german
> wikipedia xml dump - the MediaWikiDump example program dies on an
> empty username in the dump file, this could probably just be
> ignored and processing could continue.
Thanks for the bug report. This is an interesting case... It looks like the username in this case is
the unicode value for a space which is causing the underlying XML parser to miss the value in
the username field. I'm not sure what the proper method to resolve this is. I'll have to contact
the MediaWiki developers to verify this is not a bug in MediaWiki first because the only solution I
can think of to the problem can lead to behavior which will not be consistent with the
underlying MediaWiki data in the XML file.
Tyler Riddle