Skip Menu |

Preferred bug tracker

Please visit the preferred bug tracker to report your issue.

This queue is for tickets about the Image-WordCloud CPAN distribution.

Report information
The Basics
Id: 75766
Status: open
Priority: 0/
Queue: Image-WordCloud

People
Owner: brianhann [...] gmail.com
Requestors: sukria [...] sukria.net
Cc:
AdminCc:

Bug Information
Severity: Important
Broken in: 0.01
Fixed in: (no value)



Subject: Words with UTF8 are truncated
Hi, First of all, thanks a lot for this module, it was exactly what I was looking for! It's really great. I have an issue though, because I'm mostly dealing with french content, and many words in french have accented characters, like "é" or "ç", or even "à". It appears that such words are truncated by Image::WordCloud, for instance, if the word "Président" appears in the cloud, it will be displayed as "Pr" and "sident", the "é" letter is missing and the word is exploded in two parts. Any ideas why this is happening? I'm willing to help, if I can, tell me ;) Thanks again for the great module.
Hey, sorry for not seeing this sooner. I had moved everything over to github (https://github.com/c0bra/image-wordcloud-perl) but since I haven't released anything stable, the pod on CPAN still points to this tracker. I'm in the middle of trying to get a new stable release with proper word collision detection using hierarchical bounding boxes. Unfortunately I haven't had a lot of time to move forward. I'll look into this issue, though. I bet it's because you're passing in the list of words as a string and I'm doing some improper word splitting that eats UTF-8. Should hopefully be an easy fix. On Wed Mar 14 08:11:10 2012, SUKRIA wrote: Show quoted text
> Hi, > > First of all, thanks a lot for this module, it was exactly what I was > looking for! It's really great. > > I have an issue though, because I'm mostly dealing with french content, > and many words in french have accented characters, like "é" or "ç", or > even "à". It appears that such words are truncated by Image::WordCloud, > for instance, if the word "Président" appears in the cloud, it will be > displayed as "Pr" and "sident", the "é" letter is missing and the word > is exploded in two parts. > > Any ideas why this is happening? > > I'm willing to help, if I can, tell me ;) > > Thanks again for the great module.