Bug #68258 for Web-Scraper: Failed to get text of script tag

Tue May 17 01:49:24 2011 hayzer [...] gmail.com - Ticket created

Subject:

Failed to get text of script tag

Hi, I'm trying to rip off a plain JS code from the attached file. This is my code: {{{ my $scraper = scraper { process "script", "scripts[]" => "TEXT"; }; my $jscode = $scraper->scrape($htmlfile); }}} The result is: {{{ $VAR1 = { 'scripts' => [ '', '', '' ] }; }}} Sorry if it is my mistake and not a bug. Web::Scraper => 0.34 HTML::Element => 4.2 HTML::Selector::XPath => 0.07 HTML::Entities => 3.68 HTML::Tagset => 3.20 HTML::TreeBuilder::XPath => 0.12 perl -v => This is perl 5, version 12, subversion 3 (v5.12.3) built for i386-linux-thread-multi uname -a => Linux 2.6.35.12-90.fc14.i686 #1 SMP Fri Apr 22 16:14:44 UTC 2011 i686 i686 i386 GNU/Linux

Subject:

index4.html

Wed Aug 22 10:31:26 2012 https://www.google.com/accounts/o8/id?id=AItOawmEhI-Jw0rC7lxG8TI14vhh70VbpSJDDek - Correspondence added

You should be able to use 'RAW' instead of 'TEXT' to get the JS. On Tue May 17 01:49:24 2011, hayzer@gmail.com wrote: Show quoted text

> Hi, > > I'm trying to rip off a plain JS code from the attached file. > This is my code: > {{{ > my $scraper = scraper { > process "script", "scripts[]" => "TEXT"; > }; > > my $jscode = $scraper->scrape($htmlfile); > }}} > > The result is: > {{{ > $VAR1 = { > 'scripts' => [ > '', > '', > '' > ] > }; > }}} > > Sorry if it is my mistake and not a bug. > > Web::Scraper => 0.34 > HTML::Element => 4.2 > HTML::Selector::XPath => 0.07 > HTML::Entities => 3.68 > HTML::Tagset => 3.20 > HTML::TreeBuilder::XPath => 0.12 > > perl -v => > This is perl 5, version 12, subversion 3 (v5.12.3) built for > i386-linux-thread-multi > uname -a => > Linux 2.6.35.12-90.fc14.i686 #1 SMP Fri Apr 22 16:14:44 UTC 2011 i686 > i686 i386 GNU/Linux

Wed Aug 22 10:31:28 2012 The RT System itself - Status changed from 'new' to 'open'

Tue Aug 28 19:56:21 2012 MIYAGAWA [...] cpan.org - Status changed from 'open' to 'rejected'

Bug #68258 for Web-Scraper: Failed to get text of script tag

Preferred bug tracker