Skip Menu |

This queue is for tickets about the Math-String-Charset-Wordlist CPAN distribution.

Report information
The Basics
Id: 3262
Status: new
Priority: 0/
Queue: Math-String-Charset-Wordlist

People
Owner: Nobody in particular
Requestors: perl_dummy [...] bloodgate.com
Cc:
AdminCc:

Bug Information
Severity: Wishlist
Broken in: 0.05
Fixed in: (no value)



Subject: Dictionary size limited
The current code can only handle a certain size of dictionaries due to memory restrictions. The reason is that if stores the offset for each record, and thus needs at least 4 bytes for each line. Since dictionaries usually have short lines, but many of them, this limits the dictionary size quite a bit. Can be hopefully fixed by storing only every Nth offset, and loading the N offsets after this on demand by keeping X buffers of N offsets, and use them as cache.