Skip Menu |

This queue is for tickets about the Lingua-DE-ASCII CPAN distribution.

Report information
The Basics
Id: 55044
Status: resolved
Priority: 0/
Queue: Lingua-DE-ASCII

People
Owner: Nobody in particular
Requestors: jkeen [...] verizon.net
Cc:
AdminCc:

Bug Information
Severity: Critical
Broken in: (no value)
Fixed in: (no value)



Subject: File names break extraction
When I attempt to unpack this tarball, 'tar' fails on those files whose names contain non-ASCII characters: $ tar xzvf Lingua-DE-ASCII-0.11.tar.gz Lingua-DE-ASCII-0.11/ Lingua-DE-ASCII-0.11/Changes Lingua-DE-ASCII-0.11/t/ Lingua-DE-ASCII-0.11/t/words_with_ss.dat Lingua-DE-ASCII-0.11/t/english.dat Lingua-DE-ASCII-0.11/t/words_with_ae.dat Lingua-DE-ASCII-0.11/t/words_with_\337.dat tar: Lingua-DE-ASCII-0.11/t/words_with_\337.dat: Cannot open: Invalid argument Lingua-DE-ASCII-0.11/t/words_with_\344.dat tar: Lingua-DE-ASCII-0.11/t/words_with_\344.dat: Cannot open: Invalid argument Lingua-DE-ASCII-0.11/t/to_ascii.t Lingua-DE-ASCII-0.11/t/words_with_ue.dat Lingua-DE-ASCII-0.11/t/words_with_oe.dat Lingua-DE-ASCII-0.11/t/words_with_\366.dat tar: Lingua-DE-ASCII-0.11/t/words_with_\366.dat: Cannot open: Invalid argument Lingua-DE-ASCII-0.11/t/only_ascii.t Lingua-DE-ASCII-0.11/t/to_latin1.t Lingua-DE-ASCII-0.11/t/1.t Lingua-DE-ASCII-0.11/t/words_with_umlaut_and_\337.t tar: Lingua-DE-ASCII-0.11/t/words_with_umlaut_and_\337.t: Cannot open: Invalid argument Lingua-DE-ASCII-0.11/t/words_with_foreign.dat Lingua-DE-ASCII-0.11/t/special_characters.t Lingua-DE-ASCII-0.11/t/words_with_\374.dat tar: Lingua-DE-ASCII-0.11/t/words_with_\374.dat: Cannot open: Invalid argument Lingua-DE-ASCII-0.11/MANIFEST Lingua-DE-ASCII-0.11/ASCII.pm Lingua-DE-ASCII-0.11/README Lingua-DE-ASCII-0.11/Makefile.PL tar: Error exit delayed from previous errors This problem also occurs when the 'tar' is wrapped inside larger functionality such as core module Archive::Extract. I (and others) are developing functionality to traverse a minicpan repository and examine all distributions via Archive::Extract. The only way I can see to get that functionality to work properly is to put distributions that fail to extract properly on a list of distributions to be skipped. This may render the distribution less useful in the future. Can you help? Thank you very much. Jim Keenan
Hello, sorry for answering this late, but used to work something completely away from programming, so I didn't noticed it earlier: Am Sa 27. Feb 2010, 10:46:19, jkeen@verizon.net schrieb: Show quoted text
> When I attempt to unpack this tarball, 'tar' fails on those files whose > names contain non-ASCII characters: > > $ tar xzvf Lingua-DE-ASCII-0.11.tar.gz > Lingua-DE-ASCII-0.11/ > Lingua-DE-ASCII-0.11/Changes > Lingua-DE-ASCII-0.11/t/ > Lingua-DE-ASCII-0.11/t/words_with_ss.dat > Lingua-DE-ASCII-0.11/t/english.dat > Lingua-DE-ASCII-0.11/t/words_with_ae.dat > Lingua-DE-ASCII-0.11/t/words_with_\337.dat > tar: Lingua-DE-ASCII-0.11/t/words_with_\337.dat: Cannot open: Invalid > argument > Lingua-DE-ASCII-0.11/t/words_with_\344.dat > tar: Lingua-DE-ASCII-0.11/t/words_with_\344.dat: Cannot open: Invalid > argument > Lingua-DE-ASCII-0.11/t/to_ascii.t > Lingua-DE-ASCII-0.11/t/words_with_ue.dat > Lingua-DE-ASCII-0.11/t/words_with_oe.dat > Lingua-DE-ASCII-0.11/t/words_with_\366.dat > tar: Lingua-DE-ASCII-0.11/t/words_with_\366.dat: Cannot open: Invalid > argument > Lingua-DE-ASCII-0.11/t/only_ascii.t > Lingua-DE-ASCII-0.11/t/to_latin1.t > Lingua-DE-ASCII-0.11/t/1.t > Lingua-DE-ASCII-0.11/t/words_with_umlaut_and_\337.t > tar: Lingua-DE-ASCII-0.11/t/words_with_umlaut_and_\337.t: Cannot open: > Invalid argument > Lingua-DE-ASCII-0.11/t/words_with_foreign.dat > Lingua-DE-ASCII-0.11/t/special_characters.t > Lingua-DE-ASCII-0.11/t/words_with_\374.dat > tar: Lingua-DE-ASCII-0.11/t/words_with_\374.dat: Cannot open: Invalid > argument > Lingua-DE-ASCII-0.11/MANIFEST > Lingua-DE-ASCII-0.11/ASCII.pm > Lingua-DE-ASCII-0.11/README > Lingua-DE-ASCII-0.11/Makefile.PL > tar: Error exit delayed from previous errors > > This problem also occurs when the 'tar' is wrapped inside larger > functionality such as core module Archive::Extract. > > I (and others) are developing functionality to traverse a minicpan > repository and examine all distributions via Archive::Extract. The only > way I can see to get that functionality to work properly is to put > distributions that fail to extract properly on a list of distributions > to be skipped. This may render the distribution less useful in the future. > > Can you help?
I can't reproduce the problem, tar works fine for me, maybe you can give me a docker file nowadays reproducing the problem. Also, the job of the Lingua::DE::ASCII is to transform texts from ANSI to ASCII and vice versus, so neither the core module nor the tests can avoid using ANSI coding inside. It's the purpose of this CPAN module and not a bug. I could imagine it's more a problem of tar here (but as this bug request is years old, not reproducible any longer for me). If I don't hear anything from you, I'll just close this bug request. Greetings, Janek
On Sun Aug 28 06:29:35 2016, BIGJ wrote: Show quoted text
> Hello, > > sorry for answering this late, > but used to work something completely away from programming, > so I didn't noticed it earlier: > > Am Sa 27. Feb 2010, 10:46:19, jkeen@verizon.net schrieb:
> > When I attempt to unpack this tarball, 'tar' fails on those files > > whose > > names contain non-ASCII characters: > > > > $ tar xzvf Lingua-DE-ASCII-0.11.tar.gz > > Lingua-DE-ASCII-0.11/ > > Lingua-DE-ASCII-0.11/Changes > > Lingua-DE-ASCII-0.11/t/ > > Lingua-DE-ASCII-0.11/t/words_with_ss.dat > > Lingua-DE-ASCII-0.11/t/english.dat > > Lingua-DE-ASCII-0.11/t/words_with_ae.dat > > Lingua-DE-ASCII-0.11/t/words_with_\337.dat > > tar: Lingua-DE-ASCII-0.11/t/words_with_\337.dat: Cannot open: Invalid > > argument > > Lingua-DE-ASCII-0.11/t/words_with_\344.dat > > tar: Lingua-DE-ASCII-0.11/t/words_with_\344.dat: Cannot open: Invalid > > argument > > Lingua-DE-ASCII-0.11/t/to_ascii.t > > Lingua-DE-ASCII-0.11/t/words_with_ue.dat > > Lingua-DE-ASCII-0.11/t/words_with_oe.dat > > Lingua-DE-ASCII-0.11/t/words_with_\366.dat > > tar: Lingua-DE-ASCII-0.11/t/words_with_\366.dat: Cannot open: Invalid > > argument > > Lingua-DE-ASCII-0.11/t/only_ascii.t > > Lingua-DE-ASCII-0.11/t/to_latin1.t > > Lingua-DE-ASCII-0.11/t/1.t > > Lingua-DE-ASCII-0.11/t/words_with_umlaut_and_\337.t > > tar: Lingua-DE-ASCII-0.11/t/words_with_umlaut_and_\337.t: Cannot > > open: > > Invalid argument > > Lingua-DE-ASCII-0.11/t/words_with_foreign.dat > > Lingua-DE-ASCII-0.11/t/special_characters.t > > Lingua-DE-ASCII-0.11/t/words_with_\374.dat > > tar: Lingua-DE-ASCII-0.11/t/words_with_\374.dat: Cannot open: Invalid > > argument > > Lingua-DE-ASCII-0.11/MANIFEST > > Lingua-DE-ASCII-0.11/ASCII.pm > > Lingua-DE-ASCII-0.11/README > > Lingua-DE-ASCII-0.11/Makefile.PL > > tar: Error exit delayed from previous errors > > > > This problem also occurs when the 'tar' is wrapped inside larger > > functionality such as core module Archive::Extract. > > > > I (and others) are developing functionality to traverse a minicpan > > repository and examine all distributions via Archive::Extract. The > > only > > way I can see to get that functionality to work properly is to put > > distributions that fail to extract properly on a list of > > distributions > > to be skipped. This may render the distribution less useful in the > > future. > > > > Can you help?
> > I can't reproduce the problem, > tar works fine for me, > maybe you can give me a docker file nowadays reproducing the problem. > > Also, the job of the Lingua::DE::ASCII is to transform texts from ANSI > to ASCII and vice versus, so neither the core module nor the tests can > avoid using ANSI coding inside. It's the purpose of this CPAN module > and not a bug. I could imagine it's more a problem of tar here (but as > this bug request is years old, not reproducible any longer for me). > > If I don't hear anything from you, > I'll just close this bug request. > >
Yes, you can close this bug request. I suspect this was more a problem with the version of 'tar' I was using than with your distribution. Originally, I was using GNU tar 1.14 on an older Mac OS X 10.4. Today I tried the extract on these systems: Ubuntu Linux 16.04 LTS; GNU tar 1.28 FreeBSD 10.3; bsdtar 3.1.2 In both cases, the extraction was successful. Thank you very much. James Keenan