Subject: | Patch for using -l <language> option of tesseract |
Date: | Fri, 20 Feb 2009 13:17:34 +0100 |
To: | bug-Image-OCR-Tesseract [...] rt.cpan.org |
From: | Daniel Beuchler <db [...] arceo.de> |
Hi .*
this in not really a bugfix but a Patch to use
the -l option of tesseract. Newer Versions
let the user choose the language of the input
text.
Hope it is useful.
Cheers
Daniel
------8<---SNIP diff output
32c32
< my ($abs_image,$abs_tmp_dir, $lang )= @_;
---
Show quoted text
> my ($abs_image,$abs_tmp_dir )= @_;
57c57
< _tesseract($tmp_tif, $lang) || '';
---
Show quoted text > _tesseract($tmp_tif) || '';
77d76
< my $lang = shift;
79d77
< defined $lang or croak('missing language');
81c79
< system("$WHICH_TESSERACT '$abs_image' '$abs_image' -l $lang 2>/dev/
null"); # hard to check ==0
---
Show quoted text > system("$WHICH_TESSERACT '$abs_image' '$abs_image' 2>/dev/null");
# hard to check ==0