-
Notifications
You must be signed in to change notification settings - Fork 2.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ZEPPELIN-6157] Download artifacts from CDN if available #4901
Conversation
Error seems to be unrelated, happens in other runs: |
I have had the experience that download servers which return the lua script were not available. What is important to me is a stable download and a stable CI environment. Speed is of secondary importance. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
## What changes were proposed in this pull request? Current artifacts available in CDN (`dlcdn.apache.org`) may get removed without notice when new releases appear. To avoid broken links, build scripts contain permanent addresses from `archive.apache.org`. But download from `archive.apache.org` may be slow: ``` Thu, 05 Dec 2024 08:39:53 GMT [INFO] --- download:1.6.0:wget (download-sparkr-files) <at> r --- Thu, 05 Dec 2024 08:39:54 GMT Warning: No signatures were supplied, skipping file validation Thu, 05 Dec 2024 08:39:54 GMT [INFO] Read Timeout is set to 60000 milliseconds (apprx 1 minutes) Thu, 05 Dec 2024 08:45:46 GMT [INFO] Expanding: /home/runner/work/zeppelin/zeppelin/rlang/target/spark-3.5.3-bin-without-hadoop.tgz into /home/runner/work/zeppelin/zeppelin/rlang/target ``` Apache Infra's [`closer.lua` script](https://infra.apache.org/release-download-pages.html#closer) can redirect to CDN or archive, depending on artifact availability. This change replaces `archive.apache.org` URLs, and one instance of `dist.apache.org`, with their `closer.lua` equivalent. Output filename has to be specified for `wget` unfortunately. https://issues.apache.org/jira/browse/ZEPPELIN-6157 ## How was this patch tested? Tried some of the URLs locally, both from CLI (`curl -L --head`) and regular build (`mvn -DskipTests clean package`). Full CI: - quick: https://github.com/adoroszlai/zeppelin/actions/runs/12319072153 - frontend: https://github.com/adoroszlai/zeppelin/actions/runs/12319072142 - core: https://github.com/adoroszlai/zeppelin/actions/runs/12319072156 Closes #4901 from adoroszlai/ZEPPELIN-6157. Signed-off-by: Philipp Dallig <[email protected]>
Merged into master/branch-0.12 |
Thanks @Reamer for reviewing and merging this. |
What changes were proposed in this pull request?
Current artifacts available in CDN (
dlcdn.apache.org
) may get removed without notice when new releases appear. To avoid broken links, build scripts contain permanent addresses fromarchive.apache.org
. But download fromarchive.apache.org
may be slow:Apache Infra's
closer.lua
script can redirect to CDN or archive, depending on artifact availability.This change replaces
archive.apache.org
URLs, and one instance ofdist.apache.org
, with theircloser.lua
equivalent. Output filename has to be specified forwget
unfortunately.https://issues.apache.org/jira/browse/ZEPPELIN-6157
How was this patch tested?
Tried some of the URLs locally, both from CLI (
curl -L --head
) and regular build (mvn -DskipTests clean package
).Full CI: