Bug #26654 for Pod-POM-Web: [WISH] [PATCH] Pod abstract as tooltip

Tue Apr 24 22:25:07 2007 chris+rt [...] chrisdolan.net - Ticket created

Subject:

[WISH] [PATCH] Pod abstract as tooltip

[Note: the above says my patch is against 1.02, but it's really against 1.03. RT just hasn't picked up 1.03 from CPAN yet] I would like to be able to see the one-line summary of a module in the TOC listing, perhaps as a tooltip via the HTML title attribute. Attached is a not-so-good patch that implements this, along with some caching of both the HTML and the POM representations of the POD. Also attached is a screenshot showing the tooltip. I say the patch is not-so-good primarily because the TOC load time is significantly slower for the first hit because I'm pre-parsing all of the files just to get that first =head1. A better approach might be to do a quick-and-dirty parse for the =head1 NAME to get the abstract if the POM is not already cached.

Subject:

tooltip.png

Download tooltip.png
image/png 115k

Subject:

patch4

Download patch4
application/octet-stream 5.6k

Message body not shown because it is not plain text.

Tue Apr 24 23:03:09 2007 DAMI [...] cpan.org - Correspondence added

From:

DAMI [...] cpan.org

Le Mar. Avr. 24 22:25:07 2007, CDOLAN a écrit : Show quoted text

> [Note: the above says my patch is against 1.02, but it's really

against Show quoted text

> 1.03. RT just hasn't picked up 1.03 from CPAN yet] > > I would like to be able to see the one-line summary of a module in

the Show quoted text

> TOC listing, perhaps as a tooltip via the HTML title attribute. > Attached is a not-so-good patch that implements this, along with some > caching of both the HTML and the POM representations of the POD.

Also Show quoted text

> attached is a screenshot showing the tooltip. > > I say the patch is not-so-good primarily because the TOC load time is > significantly slower for the first hit because I'm pre-parsing all of > the files just to get that first =head1. A better approach might be

to Show quoted text

> do a quick-and-dirty parse for the =head1 NAME to get the abstract if > the POM is not already cached. >

Thanks for the suggestion, I'll think of that. For that application I'm not so found of caching, because it's mostly a single-user app without much load, and no strong requirements on response time, and running on a workstation, not a server : so no good reason for eating up more memory. Moreover caching already happens within the browser. As for tooltips, that could be done for installations that use the indexer, because the indexing operation has to parse all modules anyway, so at the same time it could store the module descriptions somewhere. On the other hand, the module description is just one click away ...

Tue Apr 24 23:03:11 2007 The RT System itself - Status changed from 'new' to 'open'

Thu Apr 26 02:35:19 2007 chris+rt [...] chrisdolan.net - Correspondence added

On Tue Apr 24 23:03:09 2007, DAMI wrote: Show quoted text

> As for tooltips, that could be done for installations that use the > indexer, because the indexing operation has to parse all modules > anyway, so at the same time it could store the module descriptions > somewhere. On the other hand, the module description is just one click > away ...

That makes a lot of sense. I implemented your idea in the attached patch. The altered code now stores the POD description in the docs database as a third column. When building the TOC, that DB record is retrieved and the abstract (if present) is set as the title attribute. The slowdown is negligible, both for TOC generation and indexing.

diff -ur /Users/chris/perl/lib/perl5/site_perl/Pod/POM/Web/Indexer.pm lib/Pod/POM/Web/Indexer.pm --- /Users/chris/perl/lib/perl5/site_perl/Pod/POM/Web/Indexer.pm 2007-04-24 14:33:29.000000000 -0500 +++ lib/Pod/POM/Web/Indexer.pm 2007-04-26 01:33:17.000000000 -0500 @@ -146,11 +146,10 @@ $self->_tie_docs(DB_RDONLY); foreach my $id (@doc_ids) { - my ($mtime, $path) = split "\t", $self->{_docs}{$id}, 2; + my ($mtime, $path, $description) = split "\t", $self->{_docs}{$id}; my $score = $scores->{$id}; my @filenames = $self->find_source($path); my $buf = join "\n", map {$self->slurp_file($_)} @filenames; - my ($description) = ($buf =~ /^=head1\s*NAME\s*(.*)$/m); my $excerpts = $indexer->excerpts($buf, $regex); foreach (@$excerpts) { @@ -210,13 +209,23 @@ length($search_string) >= 2 or die "module_list: arg too short"; my $regex = qr/^\d+\t\Q$search_string\E/i; - my @names = grep {/$regex/} values %{$self->{_docs}}; - s[^\d+\t][], s[/][::]g foreach @names; + my @names = map {[split "\t", $_]->[1]} grep {/$regex/} values %{$self->{_docs}}; + s[/][::]g foreach @names; my $json_names = "[" . join(",", map {qq{"$_"}} sort @names) . "]"; return $self->send_content({content => $json_names, mime_type => 'application/x-json'}); } +sub get_abstract { # override from Web.pm + my ($self, $path) = @_; + if (!$self->{_path_to_descr}) { + $self->_tie_docs(DB_RDONLY); + $self->{_path_to_descr} = { map {(split /\t/, $_)[1,2]} values %{$self->{_docs}} }; + } + my $description = $self->{_path_to_descr}->{$path} || return; + (my $abstract = $description) =~ s/^.*?-\s*//; + return $abstract; +} #---------------------------------------------------------------------- # INDEXING @@ -244,7 +253,7 @@ $self->{_max_size_for_indexing} = $options{-max_size} || $defaut_max_size_for_indexing; - # tie to docs.bdb, storing {$doc_id => "$mtime\t$pathname"} + # tie to docs.bdb, storing {$doc_id => "$mtime\t$pathname\t$description"} $self->_tie_docs(DB_CREATE); # build in-memory reverse index of info contained in %{$self->{_docs}} @@ -252,8 +261,8 @@ $self->{_previous_index} = {}; while (my ($id, $doc_descr) = each %{$self->{_docs}}) { $self->{_max_doc_id} = max($id, $self->{_max_doc_id}); - my ($mtime, $path) = split /\t/, $doc_descr; - $self->{_previous_index}{$path} = {id => $id, mtime => $mtime}; + my ($mtime, $path, $description) = split /\t/, $doc_descr; + $self->{_previous_index}{$path} = {id => $id, mtime => $mtime, description => $description}; } # open the index @@ -338,6 +347,9 @@ my $t0 = time; my $buf = join "\n", map {$self->slurp_file($_)} @filenames; + my ($description) = ($buf =~ /^=head1\s*NAME\s*(.*)$/m); + $description ||= ''; + $description =~ s/\t/ /g; $buf =~ s/^=head1\s+($ignore_headings).*$//m; # remove full line of those $buf =~ s/^=(head\d|item)//mg; # just remove command of =head* or =item $buf =~ s/^=\w.*//mg; # remove full line of all other commands @@ -345,7 +357,7 @@ my $interval = time - $t0; printf STDERR "%0.3f s.", $interval; - $self->{_docs}{$doc_id} = "$mtime\t$fullpath"; + $self->{_docs}{$doc_id} = "$mtime\t$fullpath\t$description"; } print STDERR "\n"; @@ -360,7 +372,7 @@ sub _tie_docs { my ($self, $mode) = @_; - # tie to docs.bdb, storing {$doc_id => "$mtime\t$pathname"} + # tie to docs.bdb, storing {$doc_id => "$mtime\t$pathname\t$description"} tie %{$self->{_docs}}, 'BerkeleyDB::Hash', -Filename => "$index_dir/docs.bdb", -Flags => $mode diff -ur /Users/chris/perl/lib/perl5/site_perl/Pod/POM/Web.pm lib/Pod/POM/Web.pm --- /Users/chris/perl/lib/perl5/site_perl/Pod/POM/Web.pm 2007-04-24 15:16:17.000000000 -0500 +++ lib/Pod/POM/Web.pm 2007-04-26 01:29:29.000000000 -0500 @@ -430,11 +430,15 @@ $args{attrs} = qq{TN:contentURL='toc/$entry->{node}'}; } $args{href} = $entry->{node} if $entry->{pod}; + $args{abstract} = $self->get_abstract($entry->{node}) if $entry->{pod}; $html .= generic_node(%args); } return $html; } +sub get_abstract { + # override in indexer +} sub wrap_main_toc { my ($self, $perldocs, $pragmas, $modules) = @_; @@ -754,6 +758,9 @@ my ($default_label_tag, $label_attrs) = $args{href} ? ("a", qq{ href='$args{href}'}) : ("span", "" ); + if ($args{abstract}) { + $label_attrs .= qq{ title="$args{abstract}"}; + } $args{label_tag} ||= $default_label_tag; $args{label_class} ||= "TN_label"; return qq{<div class="$args{class}"$args{attrs}>}

Thu Apr 26 02:38:03 2007 chris+rt [...] chrisdolan.net - Correspondence added

Oops, wrong patch uploaded. This is the right one. -- Chris On Thu Apr 26 02:35:19 2007, CDOLAN wrote: Show quoted text

> > That makes a lot of sense. I implemented your idea in the attached > patch. The altered code now stores the POD description in the docs > database as a third column. When building the TOC, that DB record is > retrieved and the abstract (if present) is set as the title attribute. > The slowdown is negligible, both for TOC generation and indexing.

diff -ur /Users/chris/perl/lib/perl5/site_perl/Pod/POM/Web/Indexer.pm lib/Pod/POM/Web/Indexer.pm --- /Users/chris/perl/lib/perl5/site_perl/Pod/POM/Web/Indexer.pm 2007-04-24 14:33:29.000000000 -0500 +++ lib/Pod/POM/Web/Indexer.pm 2007-04-26 01:33:17.000000000 -0500 @@ -146,11 +146,10 @@ $self->_tie_docs(DB_RDONLY); foreach my $id (@doc_ids) { - my ($mtime, $path) = split "\t", $self->{_docs}{$id}, 2; + my ($mtime, $path, $description) = split "\t", $self->{_docs}{$id}; my $score = $scores->{$id}; my @filenames = $self->find_source($path); my $buf = join "\n", map {$self->slurp_file($_)} @filenames; - my ($description) = ($buf =~ /^=head1\s*NAME\s*(.*)$/m); my $excerpts = $indexer->excerpts($buf, $regex); foreach (@$excerpts) { @@ -210,13 +209,23 @@ length($search_string) >= 2 or die "module_list: arg too short"; my $regex = qr/^\d+\t\Q$search_string\E/i; - my @names = grep {/$regex/} values %{$self->{_docs}}; - s[^\d+\t][], s[/][::]g foreach @names; + my @names = map {[split "\t", $_]->[1]} grep {/$regex/} values %{$self->{_docs}}; + s[/][::]g foreach @names; my $json_names = "[" . join(",", map {qq{"$_"}} sort @names) . "]"; return $self->send_content({content => $json_names, mime_type => 'application/x-json'}); } +sub get_abstract { # override from Web.pm + my ($self, $path) = @_; + if (!$self->{_path_to_descr}) { + $self->_tie_docs(DB_RDONLY); + $self->{_path_to_descr} = { map {(split /\t/, $_)[1,2]} values %{$self->{_docs}} }; + } + my $description = $self->{_path_to_descr}->{$path} || return; + (my $abstract = $description) =~ s/^.*?-\s*//; + return $abstract; +} #---------------------------------------------------------------------- # INDEXING @@ -244,7 +253,7 @@ $self->{_max_size_for_indexing} = $options{-max_size} || $defaut_max_size_for_indexing; - # tie to docs.bdb, storing {$doc_id => "$mtime\t$pathname"} + # tie to docs.bdb, storing {$doc_id => "$mtime\t$pathname\t$description"} $self->_tie_docs(DB_CREATE); # build in-memory reverse index of info contained in %{$self->{_docs}} @@ -252,8 +261,8 @@ $self->{_previous_index} = {}; while (my ($id, $doc_descr) = each %{$self->{_docs}}) { $self->{_max_doc_id} = max($id, $self->{_max_doc_id}); - my ($mtime, $path) = split /\t/, $doc_descr; - $self->{_previous_index}{$path} = {id => $id, mtime => $mtime}; + my ($mtime, $path, $description) = split /\t/, $doc_descr; + $self->{_previous_index}{$path} = {id => $id, mtime => $mtime, description => $description}; } # open the index @@ -338,6 +347,9 @@ my $t0 = time; my $buf = join "\n", map {$self->slurp_file($_)} @filenames; + my ($description) = ($buf =~ /^=head1\s*NAME\s*(.*)$/m); + $description ||= ''; + $description =~ s/\t/ /g; $buf =~ s/^=head1\s+($ignore_headings).*$//m; # remove full line of those $buf =~ s/^=(head\d|item)//mg; # just remove command of =head* or =item $buf =~ s/^=\w.*//mg; # remove full line of all other commands @@ -345,7 +357,7 @@ my $interval = time - $t0; printf STDERR "%0.3f s.", $interval; - $self->{_docs}{$doc_id} = "$mtime\t$fullpath"; + $self->{_docs}{$doc_id} = "$mtime\t$fullpath\t$description"; } print STDERR "\n"; @@ -360,7 +372,7 @@ sub _tie_docs { my ($self, $mode) = @_; - # tie to docs.bdb, storing {$doc_id => "$mtime\t$pathname"} + # tie to docs.bdb, storing {$doc_id => "$mtime\t$pathname\t$description"} tie %{$self->{_docs}}, 'BerkeleyDB::Hash', -Filename => "$index_dir/docs.bdb", -Flags => $mode diff -ur /Users/chris/perl/lib/perl5/site_perl/Pod/POM/Web.pm lib/Pod/POM/Web.pm --- /Users/chris/perl/lib/perl5/site_perl/Pod/POM/Web.pm 2007-04-24 15:16:17.000000000 -0500 +++ lib/Pod/POM/Web.pm 2007-04-26 01:36:34.000000000 -0500 @@ -430,11 +430,15 @@ $args{attrs} = qq{TN:contentURL='toc/$entry->{node}'}; } $args{href} = $entry->{node} if $entry->{pod}; + $args{abstract} = $self->get_abstract($entry->{node}) if $entry->{pod}; $html .= generic_node(%args); } return $html; } +sub get_abstract { + # override in indexer +} sub wrap_main_toc { my ($self, $perldocs, $pragmas, $modules) = @_; @@ -745,6 +749,8 @@ # generating GvaScript treeNavigator structure #---------------------------------------------------------------------- +my %esc = ('&' => '&', '<' => '<', '>' => '>', '"' => '"'); + sub generic_node { my %args = @_; $args{class} ||= "TN_node"; @@ -754,6 +760,10 @@ my ($default_label_tag, $label_attrs) = $args{href} ? ("a", qq{ href='$args{href}'}) : ("span", "" ); + if ($args{abstract}) { + $args{abstract} =~ s/([&<>"])/$esc{$1}/g; + $label_attrs .= qq{ title="$args{abstract}"}; + } $args{label_tag} ||= $default_label_tag; $args{label_class} ||= "TN_label"; return qq{<div class="$args{class}"$args{attrs}>}

Fri Apr 27 02:48:49 2007 DAMI [...] cpan.org - Correspondence added

implemented in 1.04, just released (together with a new feature : hyperlinks from source code to use'd modules)

Fri Apr 27 02:48:51 2007 DAMI [...] cpan.org - Status changed from 'open' to 'resolved'