Subject: | angle bracket enclosed text |
Date: | Thu, 8 Mar 2018 23:20:05 +0000 |
To: | "bug-HTML-Tree [...] rt.cpan.org" <bug-HTML-Tree [...] rt.cpan.org> |
From: | "Bruner, Todd" <tbruner [...] sandia.gov> |
Hi,
I’m trying to parse the following source HTML containing a smtp message id:
$html = <<EOF;
<html>
<div>
<CAEr2S5-HuU1MjnUQtqS6Ri-i3ZaYcTm_+cjf6mkmOgwGJHjPJA@mail.gmail.com>
</div>
</html>
EOF
my $tree = HTML::TreeBuilder->new;
$tree->implicit_tags(1);
$tree->p_strict(1);
$tree->no_space_compacting(1);
$tree->parse_content($html);
$tree->elementify;
Later as I walk the tree, when I get to the <div> node,
$node->is_empty evaluates to true.
Is there a way other than pre-processing the div node to remove the angle brackets to get HTML::Tree to recognize the text in that node?
Thanks,
Todd