Subject: | langident can't identify languages on big files (like 100 MB corpora) |
angident can't identify languages on big files (like 100 MB corpora). Different ways can be implemented to solve the problem (head, tail, mixup) but the most important at the moment is to get one of the methods working.