Subject: | File names break extraction |
When I attempt to unpack this tarball, 'tar' fails on those files whose
names contain non-ASCII characters:
$ tar xzvf Lingua-DE-ASCII-0.11.tar.gz
Lingua-DE-ASCII-0.11/
Lingua-DE-ASCII-0.11/Changes
Lingua-DE-ASCII-0.11/t/
Lingua-DE-ASCII-0.11/t/words_with_ss.dat
Lingua-DE-ASCII-0.11/t/english.dat
Lingua-DE-ASCII-0.11/t/words_with_ae.dat
Lingua-DE-ASCII-0.11/t/words_with_\337.dat
tar: Lingua-DE-ASCII-0.11/t/words_with_\337.dat: Cannot open: Invalid
argument
Lingua-DE-ASCII-0.11/t/words_with_\344.dat
tar: Lingua-DE-ASCII-0.11/t/words_with_\344.dat: Cannot open: Invalid
argument
Lingua-DE-ASCII-0.11/t/to_ascii.t
Lingua-DE-ASCII-0.11/t/words_with_ue.dat
Lingua-DE-ASCII-0.11/t/words_with_oe.dat
Lingua-DE-ASCII-0.11/t/words_with_\366.dat
tar: Lingua-DE-ASCII-0.11/t/words_with_\366.dat: Cannot open: Invalid
argument
Lingua-DE-ASCII-0.11/t/only_ascii.t
Lingua-DE-ASCII-0.11/t/to_latin1.t
Lingua-DE-ASCII-0.11/t/1.t
Lingua-DE-ASCII-0.11/t/words_with_umlaut_and_\337.t
tar: Lingua-DE-ASCII-0.11/t/words_with_umlaut_and_\337.t: Cannot open:
Invalid argument
Lingua-DE-ASCII-0.11/t/words_with_foreign.dat
Lingua-DE-ASCII-0.11/t/special_characters.t
Lingua-DE-ASCII-0.11/t/words_with_\374.dat
tar: Lingua-DE-ASCII-0.11/t/words_with_\374.dat: Cannot open: Invalid
argument
Lingua-DE-ASCII-0.11/MANIFEST
Lingua-DE-ASCII-0.11/ASCII.pm
Lingua-DE-ASCII-0.11/README
Lingua-DE-ASCII-0.11/Makefile.PL
tar: Error exit delayed from previous errors
This problem also occurs when the 'tar' is wrapped inside larger
functionality such as core module Archive::Extract.
I (and others) are developing functionality to traverse a minicpan
repository and examine all distributions via Archive::Extract. The only
way I can see to get that functionality to work properly is to put
distributions that fail to extract properly on a list of distributions
to be skipped. This may render the distribution less useful in the future.
Can you help?
Thank you very much.
Jim Keenan