Skip Menu |

This queue is for tickets about the RTF-Parser CPAN distribution.

Report information
The Basics
Id: 122
Status: new
Priority: 0/
Queue: RTF-Parser

People
Owner: stuart [...] morungos.com
Requestors: cfeied [...] ncemi.org
Cc:
AdminCc:

Bug Information
Severity: Important
Broken in: 1.07
Fixed in: (no value)



Subject: improper nesting of html attributes
Hi Two brief issues. The first relates to extra html end-attribute markers that make the html invalid. The second is the nesting of html attributes. ===== When I mark text in WordXP with any text attribute (e.g. strikeout) I get a lot of extra end-attribute markers from rtf2html. ===== Example: A single word marked as strikeout generated this rtf: ===== {\strike\insrsid11875762\charrsid11875762 \hich\af0\dbch\af11\loch\f0 St\hich\af0\dbch\af11\loch\f0 r\hich\af0\dbch\af11\loch\f0 ikeout}{\strike\insrsid5449462\charrsid11875762 \par }{\insrsid11875762 \hich\af0\dbch\af11\loch\f0 H}{\sub\insrsid11875762\charrsid11875762 \hich\af0\dbch\af11\loch\f0 2}{\insrsid11875762 \hich\af0\dbch\af11\loch\f0 0 etc. ======== which then yields this html: ======== <p ALIGN=LEFT><strike>Strikeout</strike></strike></p> <p ALIGN=LEFT></strike>H<sub>2</sub>0</p> etc. ========= This also occurs with bold, underline, italic, etc. The second issue is that when the rtf contains {\i\b\ul Hello World} then the HTML generated is <i><b><u> Hello World</i></b></u> which we think is perhaps not nested correctly. ================ Despite these minor problems, we thank you for your clear, functional, and very useful code! -- Craig Feied
From: cfeied [...] ncemi.org
[guest - Tue Jan 8 02:03:35 2002]: Oops -- The html below is now shown, rather than being rendered ===================== Hi Two brief issues. The first relates to extra html end-attribute markers that make the html invalid. The second is the nesting of html attributes. ===== When I mark text in WordXP with any text attribute (e.g. strikeout) I get a lot of extra end-attribute markers from rtf2html. ===== Example: A single word marked as strikeout generated this rtf: ===== {\strike\insrsid11875762\charrsid11875762 \hich\af0\dbch\af11\loch\f0 St\hich\af0\dbch\af11\loch\f0 r\hich\af0\dbch\af11\loch\f0 ikeout}{\strike\insrsid5449462\charrsid11875762 \par }{\insrsid11875762 \hich\af0\dbch\af11\loch\f0 H}{\sub\insrsid11875762\charrsid11875762 \hich\af0\dbch\af11\loch\f0 2}{\insrsid11875762 \hich\af0\dbch\af11\loch\f0 0 etc. ======== which then yields this html: ======== &lt;p ALIGN=LEFT&gt;&lt;strike&gt;Strikeout&lt;/strike&gt;&lt;/strike&gt;&lt;/ p&gt; &lt;p ALIGN=LEFT&gt;&lt;/strike&gt;H&lt;sub&gt;2&lt;/sub&gt;0&lt;/p&gt; etc. ========= This also occurs with bold, underline, italic, etc. The second issue is that when the rtf contains {\i\b\ul Hello World} then the HTML generated is &lt;i&gt;&lt;b&gt;&lt;u&gt; Hello World&lt;/i&gt;&lt;/b&gt;&lt;/u&gt; which we think is perhaps not nested correctly. ================ Despite these minor problems, we thank you for your clear, functional, and very useful code! -- Craig Feied