Skip Menu |

This queue is for tickets about the HTML-Parser CPAN distribution.

Report information
The Basics
Id: 42414
Status: resolved
Priority: 0/
Queue: HTML-Parser

People
Owner: Nobody in particular
Requestors: ville.skytta [...] iki.fi
Cc:
AdminCc:

Bug Information
Severity: Unimportant
Broken in: 3.59
Fixed in: (no value)



Subject: [PATCH] Spelling fixes
Spelling fixes.
Subject: 0001-Spelling-fixes.patch
From da82777ec449646df0cedf1898dd2fd28a0f0a19 Mon Sep 17 00:00:00 2001 From: =?utf-8?q?Ville=20Skytt=C3=A4?= <ville.skytta@iki.fi> Date: Wed, 14 Jan 2009 20:42:18 +0200 Subject: [PATCH] Spelling fixes. --- Changes | 16 ++++++++-------- Parser.pm | 6 +++--- TODO | 2 +- lib/HTML/Entities.pm | 2 +- lib/HTML/HeadParser.pm | 2 +- lib/HTML/LinkExtor.pm | 4 ++-- 6 files changed, 16 insertions(+), 16 deletions(-) diff --git a/Changes b/Changes index 531f83c..82b5bf9 100644 --- a/Changes +++ b/Changes @@ -41,7 +41,7 @@ Release 3.56 - Cloning of parser state for compatiblity with threads. + Cloning of parser state for compatibility with threads. Fixed by Bo Lindbergh <blgl@hagernas.com>. Don't require whitespace between declaration tokens. @@ -56,7 +56,7 @@ Treat <> at the end of document as text. Used to be reported as a comment. - Improved Firefox compatiblity for bad HTML: + Improved Firefox compatibility for bad HTML: - Unclosed <script>, <style> are now treated as empty tags. - Unclosed <textarea>, <xmp> and <plaintext> treat rest as text. - Unclosed <title> closes at next tag. @@ -257,7 +257,7 @@ garbage with older versions of perl. Emit warning if entities are decoded and something in the first - chunk looks like hibit UTF-8. Previously this warning was only + chunk looks like hi-bit UTF-8. Previously this warning was only triggered for documents with BOM. @@ -344,7 +344,7 @@ Release 3.37 Improved handling of HTML encoded surrogate pairs and illegally - endoded Unicode; <http://rt.cpan.org/Ticket/Display.html?id=7785>. + encoded Unicode; <http://rt.cpan.org/Ticket/Display.html?id=7785>. Patch by John Gardiner Myers <jgmyers@proofpoint.com>. Avoid generating bad UTF8 strings when decoding entities @@ -599,7 +599,7 @@ Release 3.21 - Fix a memory leak which occured when using filter methods. + Fix a memory leak which occurred when using filter methods. Avoid a few compiler warnings (DEC C): - Trailing comma found in enumerator list @@ -1274,7 +1274,7 @@ Faster HTML::LinkExtor by taking advantage of the new callback interface. The module now also uses URI.pm (instead - of the old URI::URL) to do URI-absolutations. + of the old URI::URL) to absolutize URIs. Faster HTML::TokeParser by taking advantage of new accum interface. @@ -1408,7 +1408,7 @@ instead of raising an exception, and strings like "*STDIN" are not treated as globs any more. - HTML::LinkExtor knowns about background attribute of <tables>. + HTML::LinkExtor knows about background attribute of <tables>. Patch by Clinton Wong <clintdw@netcom.com> HTML::TokeParser will parse large inline strings much faster now. @@ -1491,7 +1491,7 @@ Release 2.16 - The HTML::Parser could some times break hex entites (like &#xFFFF;) + The HTML::Parser could some times break hex entities (like &#xFFFF;) in the middle. Removed remaining forced dependencies on libwww-perl modules. It diff --git a/Parser.pm b/Parser.pm index 13d8100..3a8040a 100644 --- a/Parser.pm +++ b/Parser.pm @@ -334,8 +334,8 @@ entities alone. =item $p->backquote( $bool ) By default, only ' and " are recognized as quote characters around -attribute values. MSIE also recognize backquotes for some reason. -Enabling this attribute provide compatiblity with this behaviour. +attribute values. MSIE also recognizes backquotes for some reason. +Enabling this attribute provides compatibility with this behaviour. =item $p->boolean_attribute_value( $val ) @@ -1200,7 +1200,7 @@ has been opened in ":utf8" mode. The parser can process raw undecoded UTF-8 sanely if the C<utf8_mode> is enabled or if the "attr", "@attr" or "dtext" argspecs is avoided. -=item Parsing string decoded with wrong endianess +=item Parsing string decoded with wrong endianness (W) The first character in the document is U+FFFE. This is not a legal Unicode character but a byte swapped BOM. The result of parsing diff --git a/TODO b/TODO index e3e5290..4429562 100644 --- a/TODO +++ b/TODO @@ -2,7 +2,7 @@ TODO - limit the length of markup elements that never end. Perhaps by configurable limits on the length that markup can have and still - be recongnized. Report stuff as 'text' when this happens? + be recognized. Report stuff as 'text' when this happens? - remove 255 char limit on literal argspec strings - implement backslash escapes in literal argspec string - <![%app1;[...]]> (parameter entities) diff --git a/lib/HTML/Entities.pm b/lib/HTML/Entities.pm index 3d77d56..a066489 100644 --- a/lib/HTML/Entities.pm +++ b/lib/HTML/Entities.pm @@ -57,7 +57,7 @@ values are what they should expand into. The values do not have to be single character strings. If a key has ";" as suffix, then occurrences in $string are only expanded if properly terminated with ";". Entities without ";" will be expanded regardless of how -they are terminated for compatiblity with how common browsers treat +they are terminated for compatibility with how common browsers treat entities in the Latin-1 range. If $expand_prefix is TRUE then entities without trailing ";" in diff --git a/lib/HTML/HeadParser.pm b/lib/HTML/HeadParser.pm index 0a39c8c..eed15c1 100644 --- a/lib/HTML/HeadParser.pm +++ b/lib/HTML/HeadParser.pm @@ -85,7 +85,7 @@ $VERSION = "3.59"; The object constructor. The optional $header argument should be a reference to an object that implement the header() and push_header() methods as defined by the C<HTTP::Headers> class. Normally it will be -of some class that isa or delegates to the C<HTTP::Headers> class. +of some class that is a or delegates to the C<HTTP::Headers> class. If no $header is given C<HTML::HeadParser> will create an C<HTTP::Header> object by itself (initially empty). diff --git a/lib/HTML/LinkExtor.pm b/lib/HTML/LinkExtor.pm index b5ba20e..6285959 100644 --- a/lib/HTML/LinkExtor.pm +++ b/lib/HTML/LinkExtor.pm @@ -104,7 +104,7 @@ sub _found_link =item $p->links Returns a list of all links found in the document. The returned -values will be anonymous arrays with the follwing elements: +values will be anonymous arrays with the following elements: [$tag, $attr => $url1, $attr2 => $url2,...] @@ -155,7 +155,7 @@ received using LWP: } # Make the parser. Unfortunately, we don't know the base yet - # (it might be diffent from $url) + # (it might be different from $url) $p = HTML::LinkExtor->new(\&callback); # Request document and parse it as it arrives -- 1.5.6.6
Thanks. Patch applied.