Skip Menu |

This queue is for tickets about the HTML-Tree CPAN distribution.

Report information
The Basics
Id: 124734
Status: new
Priority: 0/
Queue: HTML-Tree

People
Owner: Nobody in particular
Requestors: tbruner [...] sandia.gov
Cc:
AdminCc:

Bug Information
Severity: (no value)
Broken in: (no value)
Fixed in: (no value)



Subject: angle bracket enclosed text
Date: Thu, 8 Mar 2018 23:20:05 +0000
To: "bug-HTML-Tree [...] rt.cpan.org" <bug-HTML-Tree [...] rt.cpan.org>
From: "Bruner, Todd" <tbruner [...] sandia.gov>
Hi, I’m trying to parse the following source HTML containing a smtp message id: $html = <<EOF; <html> <div> <CAEr2S5-HuU1MjnUQtqS6Ri-i3ZaYcTm_+cjf6mkmOgwGJHjPJA@mail.gmail.com> </div> </html> EOF my $tree = HTML::TreeBuilder->new; $tree->implicit_tags(1); $tree->p_strict(1); $tree->no_space_compacting(1); $tree->parse_content($html); $tree->elementify; Later as I walk the tree, when I get to the <div> node, $node->is_empty evaluates to true. Is there a way other than pre-processing the div node to remove the angle brackets to get HTML::Tree to recognize the text in that node? Thanks, Todd