Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Corrupted files on whosonfirst.mapzen.com? #189

Closed
ong opened this issue Jan 25, 2017 · 8 comments
Closed

Corrupted files on whosonfirst.mapzen.com? #189

ong opened this issue Jan 25, 2017 · 8 comments

Comments

@ong
Copy link

ong commented Jan 25, 2017

Running the download script results in the following corrupted file error on multiple bundles:

curl https://whosonfirst.mapzen.com/bundles/wof-region-latest-bundle.tar.bz2 | tar -xj --strip-components=1 --exclude=README.txt -C ...

curl: (56) GnuTLS recv error (-54): Error in the pull function.

bzip2: Compressed file ends unexpectedly;
	perhaps it is corrupted?  *Possible* reason follows.
bzip2: Inappropriate ioctl for device
	Input file = (stdin), output file = (stdout)

It is possible that the compressed file(s) have become corrupted.
You can use the -tvv option to test integrity of such files.

You can use the `bzip2recover' program to attempt to recover
data from undamaged sections of corrupted files.

tar: Unexpected EOF in archive
tar: Unexpected EOF in archive
tar: Error is not recoverable: exiting now
@ong
Copy link
Author

ong commented Jan 25, 2017

Error disappears upon rerunning download script.

@ong ong closed this as completed Jan 25, 2017
@trescube
Copy link
Contributor

Sorry for the late reply, but if you encounter further issues with the data or data bundles, please report them to https://github.com/whosonfirst-data. This repository is just the Pelias importer of Who's on First data.

@orangejulius
Copy link
Member

@thisisaaronland might be interested in seeing this even though it's now fixed.

@thisisaaronland
Copy link

thisisaaronland commented Jan 25, 2017

This appears to be a TLS issue rather than a data issue?

@ong
Copy link
Author

ong commented Jan 25, 2017

The only difference I could tell between the unsuccessful run and the successful one is that the successful one had a faster network connection and thus finished downloading faster. Could this be a timeout issue followed by the script attempting to uncompress a partially downloaded file?

@jeremy-rutman
Copy link

I've hit this on the same file twice now:

error downloading whosonfirst-data-locality-latest.tar.bz2 bundle: Error: Command failed: curl -s https://dist.whosonfirst.org/bundles/whosonfir
st-data-locality-latest.tar.bz2 | tar -xj --strip-components=1 --exclude=README.txt -C /mnt/storage-proc2/users/jeremy/pelias_metal/data/whosonf
irst && mv /mnt/storage-proc2/users/jeremy/pelias_metal/data/whosonfirst/whosonfirst-data-locality-latest.csv /mnt/storage-proc2/users/jeremy/pe
lias_metal/data/whosonfirst/meta

bzip2: Compressed file ends unexpectedly;
        perhaps it is corrupted?  *Possible* reason follows.
bzip2: Inappropriate ioctl for device
        Input file = (stdin), output file = (stdout)

It is possible that the compressed file(s) have become corrupted.
You can use the -tvv option to test integrity of such files.

You can use the `bzip2recover' program to attempt to recover
data from undamaged sections of corrupted files.

tar: Unexpected EOF in archive
tar: Unexpected EOF in archive
tar: Error is not recoverable: exiting now

@missinglink
Copy link
Member

We don't have any control over the WhosOnFirst servers, please report the issue with the data provider here: https://github.com/whosonfirst-data/whosonfirst-data

@jeremy-rutman
Copy link

possibly a long untar leads to this error? I added a bit at 405

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants