Subject: | Automatic RSID marks bug in Libreoffice - Work around? |
Date: | Mon, 6 Jun 2016 08:17:37 +0200 (CEST) |
To: | bug-ODF-lpOD [...] rt.cpan.org |
From: | Guido Brugnara <gdo [...] leader.it> |
We use "serialize" method to extract sections in XML to render with DTL::Fast parser [https://metacpan.org/pod/DTL::Fast].
But I have encountered this problem:
in document I read: "{{ MyHash.MyVar }}" but in content.xml there is:
"<text:p text:style-name="P9">{{ <text:span text:style-name="T5">MyHash</text:span>.MyVar }}</text:p>"
In the document the text appears uniform, but a part is marked with a TAG "text:span", consequently the DTL parser not recognize the "{{...}}" tag.
Unfortunately, ODT documents generated with Libreoffice suffer from a bug that causes the proliferation of SPAN elements even if different formatting attributes are not included, compared to the adjacent text.
The problem is documented here:
https://bugs.documentfoundation.org/show_bug.cgi?id=68183&redirected_from=fdo
The following XML code shows the style:style content:
"<style:style style:name="T5" style:family="text"><style:text-properties officeooo:rsid="00260f63"/></style:style>"
Prior to applying the DTL parser, it is then necessary to verify if there are no text:span elements with style containing only officeooo:rsid properties which will be deleted prior to proceed with the parsing.
What method do you suggest to work around this problem?
bye
Gdo