Skip Menu |

This queue is for tickets about the HTML-DOM CPAN distribution.

Report information
The Basics
Id: 116941
Status: open
Priority: 0/
Queue: HTML-DOM

People
Owner: Nobody in particular
Requestors: 'spro^^*%*^6ut# [...] &$%*c
Cc:
AdminCc:

Bug Information
Severity: (no value)
Broken in: (no value)
Fixed in: (no value)



Subject: Unknown tags are dropped
https://rt.cpan.org/Ticket/Display.html?id=116940 applies to HTML::DOM, too. The shortest code that will reproduce it involves WWW::Scripter: $ perl -mWWW::Scripter -e '$w = new WWW::Scripter; $w->get("data:text/html, <abc></abc>"); warn $w->content' <html><head></head><body></body> </html> at -e line 1.
Subject: [rt.cpan.org #116941] Unknown tags are dropped
Date: Mon, 24 Feb 2020 09:07:23 -0800
To: bug-HTML-DOM [...] rt.cpan.org
From: Wayne Davison <wayne [...] opencoder.net>
I've been manually editing the installed HTML/DOM.pm file to tell the parser to stop ignoring unknown tags. My change looks like this (I added one line to the "open" sub): sub open { (my $self = shift)->detach_content; # [... elided comment ...] $self->push_content( my $tb = $$self{_HTML_DOM_parser} = new HTML::DOM::Element::HTML ); $tb->ignore_unknown(0); # NOTE: Added! That makes allowing unknown tags the default, and has worked very well for me. ..wayne..