Skip Menu |

This queue is for tickets about the XML-SAX CPAN distribution.

Report information
The Basics
Id: 35006
Status: rejected
Priority: 0/
Queue: XML-SAX

People
Owner: Nobody in particular
Requestors: MLAWREN [...] cpan.org
Cc:
AdminCc:

Bug Information
Severity: Wishlist
Broken in: (no value)
Fixed in: (no value)



Subject: Provide hooks/access to the underlying XML::LibXML object parameters
I've been fighting with problems like "entity 'nbsp' not defined" when parsing html documents. Maybe I just haven't found the knobs to allow extra entites (any hints would be appreciated), but it would anyway be nice to have access to the underlying XML::LibXML object. Then I could call things like $parser->expand_entities(0). Cheers, Mark. -- Mark Lawrence
XML::SAX does not have an "underlying XML::LibXML object". Also if your document uses   but does not declare the meaning of that entity in a DTD then your document is not well formed XML and an XML parser module *must* reject it. If you want to parse HTML, then I recommend the XML::LibXML module. It has a parse_html method which will cope with standard HTML entities and the resulting DOM's XPath support makes it easier to use than the SAX API.