Bug #23954 for RTF-Parser: RTF::HTML::Converter kudos + fixes

Subject:	RTF::HTML::Converter kudos + fixes
Date:	Fri, 15 Dec 2006 21:50:13 -0600
To:	bug-RTF-Parser [...] rt.cpan.org
From:	Cynbe ru Taren <cynbe [...] muq.org>

FWIW, converting a Nisus RTF file, we found we needed to make a few additions to the tables: Charsets.pm # I added the following entries to all three # of the tables %ansi %mac %pc # since I didn't know which was/were the most # relevant: aafterfl 61 cafterfi 63 eafterfi 65 nafterfi 6e oafterfl 6f rafterfi 72 uafterfi 75 char_map # Here I added the matching entries: aafterfl a cafterfi c eafterfi e nafterfi n rafterfi r oafterfl o uafterfi u (Without the above, the output HTML file wound up being littered with '61' '65' etc, which is most unsightly.) It seems highly probable that the actual standard, where-ever it is, in fact defines a complete alphabetic sequence: aafterfl 61 bafterfi 62 cafterfi 63 dafterfi 64 eafterfi 65 fafterfi 66 gafterfi 67 hafterfi 68 iafterfi 69 jafterfi 6a kafterfi 6b lafterfi 6c mafterfi 6d nafterfi 6e oafterfl 6f pafterfl 70 qafterfl 71 rafterfi 72 safterfi 73 tafterfi 74 uafterfi 75 vafterfi 76 wafterfi 77 xafterfi 78 yafterfi 79 zafterfi 7a Or at the least, it is a curious coincidence that the ones empirically derived above fit perfectly into the above guesswork sequence. :) Presumably they are supposed to be kerned somewhat differently due to appearing immediately after a ligature. There may be an issue with treatment of tabs, too, I haven't looked at that in detail yet. Life is Good! -- Cynbe