[ZEPPELIN-5969] Remove Hadoop2 and move to Hadoop3 shaded client #4691

pan3793 · 2023-11-13T05:41:17Z

What is this PR for?

This PR is based on #4674.

This PR aligns Hadoop dependencies version (3.3.6) in all Zeppelin modules, additionally, it switches to Hadoop shaded client (introduced in HADOOP-11804 since Hadoop 3.0.0) which is recommended by Hadoop, Spark also switched to Hadoop shaded client in SPARK-33212 (3.2.0), which dramatically reduce the Hadoop dependency management efforts, basically, only 3 hadoop jars are required now:

hadoop-client-api
hadoop-client-runtime
hadoop-client-minicluster (only for testing)

There is a JDK 10 compatible issue fixed in HIVE-21508 (Hive 2.3.7), so this PR also upgrades Flink's Hive version from 2.3.4 to 2.3.7. Hive 2.3.7 is chosen because Flink supports Hive 2.3.8 and 2.3.9 in FLINK-26739 (1.16.0) while Zeppelin supports Flink 1.15~1.17.

What type of PR is it?

Improvement

What is the Jira issue?

ZEPPELIN-5969

How should this be tested?

The test cases are modified corresponding, and all GHA tests are passed

Screenshots (if appropriate)

Questions:

Does the license files need to update? No
Is there breaking changes for older versions? Yes
Does this needs documentation? Yes

matthias-koch · 2023-11-15T14:15:03Z

rlang/pom.xml

-            <version>${hadoop.version}</version>
-            <scope>compile</scope>
-        </dependency>
-
        <dependency>


Suggested change

<dependency>

<dependency>

<groupId>org.apache.hadoop</groupId>

<artifactId>hadoop-client-api</artifactId>

<version>${hadoop.version}</version>

<scope>compile</scope>

</dependency>

<dependency>

Without the dependency on hadoop-client-api, r.ir and r.shiny throw an exception upon execution: java.lang.NoClassDefFoundError: org/apache/hadoop/fs/FSDataInputStream.

Thanks for checking. I think hadoop-client-runtime will pull hadoop-client-api, thus there should be no problem.

This PR is not ready (I'm a little busy these days, I may can continue this work next week)

hadoop-client-api is a runtime dependency of hadoop-client-runtime, so maven-shaded-plugin should include it. However, I have created the R interpreter package on my local machine and without adding the dependency explicitly, hadoop-client-api is not present in the shaded jar. (But maybe I am doing somthing wrong.)

How do you know that the maven-shade plugin also adds runtime dependencies? After a first overview I think that the dependency hadoop-client-api has to be added manually.

The description at https://maven.apache.org/plugins/maven-shade-plugin/shade-mojo.html says:

Requires dependency resolution of artifacts in scope: runtime

But I am really not sure, how all dependencies are resolved.

alright, I know where the issue comes from.

Do we want to respect the hadoop.deps.scope if we switch to the Hadoop Shaded client?

Note: Spark drops such profiles and always ships Hadoop Shaded client jars into the binary artifacts.

<hadoop.deps.scope>provided</hadoop.deps.scope> ... <profile> <id>include-hadoop</id> <properties> <hadoop.deps.scope>compile</hadoop.deps.scope> </properties> </profile>

The main reason for the introduction, I believe, was the interpreter launch mode YARN. See #3786
I don't use this setup myself. As far as I understand it correctly, YARN has already loaded the Hadoop dependencies and therefore they don't need to be in Zeppelin additionally. Perhaps @zjffdu can say more about this.

I am of the opinion that we should include the shaded Hadoop dependency, as this is not available in Kubernetes, Docker, for example. However, we should make sure that we do not deliver the library more than once (e.g. additionally in plugins See #3817)

As far as I understand it correctly, YARN has already loaded the Hadoop dependencies and therefore they don't need to be in Zeppelin additionally.

There is a switch in YARN to enable/disable Hadoop class population for containers.

we should make sure that we do not deliver the library more than once

QQ, I understand we should not include Hadoop classes in plugins, because they will be loaded into the same JVM with Zeppelin server, so that they can share the Hadoop classes. What about the interpreteres? I assume the interpreters are always run in dedicated JVMs, so Hadoop classes seem always necessary (except for those runtimes who already provided Hadoop classes, e.g. Spark, Flink)?

There is a switch in YARN to enable/disable Hadoop class population for containers.

I don't know how this is used in Zeppelin.

QQ, I understand we should not include Hadoop classes in plugins, because they will be loaded into the same JVM with Zeppelin server, so that they can share the Hadoop classes. What about the interpreteres? I assume the interpreters are always run in dedicated JVMs, so Hadoop classes seem always necessary (except for those runtimes who already provided Hadoop classes, e.g. Spark, Flink)?

Correct the Zeppelin server & the zengine use the same JVM as the Zeppelin plugins.
In my opinion, the interpreters usually run in separate JVM instances. We should set the scope of Hadoop to Provided in the interpreter, because I think the Hadoop code in the interpreter is only in use for YARN. See

zeppelin/zeppelin-interpreter/src/main/java/org/apache/zeppelin/interpreter/remote/YarnUtils.java

Line 20 in 56da029

import org.apache.hadoop.conf.Configuration;

Maybe there will be a way to remove the dependency at some point.

I revised this question, the rlang interpreter can work both independently and combine with SparkR, for independent case, Hadoop deps are not required, for SparkR case, the Hadoop jars are handled by Spark runtime, either ship Hadoop jars in SPARK_HOME/jars, or setup HADOOP_CLASSPATH properly, so the rlang interpreter don't need to ship Hadoop deps

pan3793 · 2023-11-27T02:20:55Z

I basically have fixed all compile and test issues, the next step is to split it into several small PRs to speed up the review process.

I think we should start with the interpreter modules one by one, and then zengine, server and other modules, eventually dropping the hadoop2 profile and updating docs.

@Reamer could you give some advice?

Reamer · 2023-11-27T13:46:47Z

I would prefer a larger PR. Where individual tasks are contained in commits. It was clear that the drop of Hadoop2 is very large. Thank you for your work so far.

I think it's great that you have deleted all the excludes in the parent pom.xml, that makes the file much more readable.

Btw. I do not insist on co-authorship.

pan3793 · 2023-12-01T13:57:53Z

unfortunately, I found the IT does not run properly now, see #4699, we may need to postpone this PR after recovering IT

pan3793 · 2024-03-23T13:27:44Z

flink/flink-scala-2.12/pom.xml

    <dependency>
      <groupId>org.apache.hive</groupId>
      <artifactId>hive-metastore</artifactId>
      <version>${hive.version}</version>
      <scope>provided</scope>
      <exclusions>
        <exclusion>
-          <artifactId>hadoop-auth</artifactId>
-          <groupId>org.apache.hadoop</groupId>
+          <groupId>org.apache.hive</groupId>


Hive's transitive deps management is crazy, the exclusion list is borrowed from Apache Flink's pom.xml

flink/flink-scala-2.12/pom.xml

hbase/pom.xml

pan3793 · 2024-03-23T14:44:12Z

@Reamer it's ready for review, please take a look when you have time

pan3793 · 2024-03-23T14:51:23Z

pom.xml

    <hadoop3.2.version>3.2.4</hadoop3.2.version>
    <hadoop3.3.version>3.3.6</hadoop3.3.version>
-    <hadoop.version>${hadoop2.7.version}</hadoop.version>
+    <hadoop.version>${hadoop3.3.version}</hadoop.version>


Hadoop 3.2 is EOL, from the community perspective, we'd better choose a supported version.
https://lists.apache.org/thread/ybygrhvqok0f44s7yzm6c24n0b0s727s

another reason is, we use the hadoop-client-minicluster to launch Mini YARN cluster for testing in some modules, while since Spark 3.2.0, the pre-built Spark tgz ships Hadoop 3.3.x jars, Hadoop requires that client version <= server version, so we must use Hadoop 3.3 to launch Mini YARN cluster to allow Spark works properly.

Reamer

I really like this PR. It removes a lot of unnecessary legacy from the Zeppelin code. I have set a few comments that I would like to have clarified before the merge.

flink/flink-scala-2.12/pom.xml

alluxio/pom.xml

hbase/pom.xml

pom.xml

rlang/pom.xml

pan3793 · 2024-04-01T12:36:07Z

The Python 3.8 test failure should be addressed in #4748

pan3793 · 2024-04-02T10:30:22Z

@Reamer all failed tests are known flaky tests, this patch should be good to go :)

Reamer · 2024-04-02T11:30:42Z

I will merge the pull request on Wednesday as long as no further comments are received.

Armadik · 2024-05-23T14:16:22Z

Could this change break the build? I try to collect and get an error

WARN [2024-05-23 17:04:06,762] ({main} WebAppContext.java[doStart]:533) - Failed startup of context o.e.j.w.WebAppContext@4816278d{/,jar:file:///opt/zeppelin/zeppelin-web-0.12.0-SNAPSHOT.war!/,STOPPED}{/opt/zeppelin/zeppelin-web-0.12.0-SNAPSHOT.war} java.io.FileNotFoundException: JAR entry WEB-INF/lib/hadoop-client-api-3.3.6.jar!/ not found in /opt/zeppelin/zeppelin-web-0.12.0-SNAPSHOT.war

pan3793 · 2024-05-24T02:05:16Z

@Armadik mind providing a reproducible step? e.g. build command, start command, OS platform, JDK version, etc.

Armadik · 2024-05-24T06:37:15Z

I see an error when running the zeppelin.sh script

Ubuntu 22.04.4 LTS
`apt update

apt install -y curl git maven openjdk-11-jdk npm libfontconfig r-base-dev r-cran-evaluate

wget https://repo.maven.apache.org/maven2/org/apache/maven/apache-maven/3.6.3/apache-maven-3.6.3-bin.tar.gz

sudo tar -zxf apache-maven-3.6.3-bin.tar.gz -C /usr/local/

sudo ln -s /usr/local/apache-maven-3.6.3/bin/mvn /usr/local/bin/mvn

cd Documents/

git clone https://github.com/apache/zeppelin.git

cd zeppelin/

export MAVEN_OPTS="-Xms1024M -Xmx4096M -XX:MaxMetaspaceSize=1024m -XX:-UseGCOverheadLimit -Dorg.slf4j.simpleLogger.log.org.apache.maven.cli.transfer.Slf4jMavenTransferListener=war"

./mvnw -B package -DskipTests -Pbuild-distr -Pspark-3.3 -Pinclude-hadoop -Phadoop3 -Pspark-scala-2.12 -Pweb-angular -Pweb-dist -pl '!groovy,!submarine,!flink,!cassandra,!jdbc,!bigquery,!alluxio,!mongodb,!neo4j' -am --no-transfer-progress
`

Armadik · 2024-05-24T13:09:45Z

] Copying webapp resources [/home/micha/Documents/zeppelin/zeppelin-web/dist]
[INFO] deleting outdated resource WEB-INF/lib/hadoop-client-api-3.3.6.jar
[INFO] deleting outdated resource WEB-INF/lib/hadoop-client-runtime-3.3.6.jar
[INFO] Building war: /zeppelin/zeppelin-web/target/zeppelin-web-0.12.0-SNAPSHOT.war``

This one doesn't seem to work

The team helped me
zip -d /opt/zeppelin/zeppelin-web-0.12.0-SNAPSHOT.war WEB-INF/lib/*

pan3793 · 2024-07-03T14:27:07Z

@Armadik sorry, can not reproduce, both classic and new UI are good on my side.

Armadik · 2024-08-01T13:15:50Z

I tried a clean build. It seems the problem was in my environment(

* [MINOR] Update jdbc.md Change postgres username (apache#4704) change postgres default username mysql_user/mysql_password to pg_user/pg_password. * Bump mathjax from 2.7.0 to 3.0.0 in /zeppelin-web (apache#4705) Bumps [mathjax](https://github.com/mathjax/MathJax) from 2.7.0 to 3.0.0. - [Release notes](https://github.com/mathjax/MathJax/releases) - [Commits](mathjax/MathJax@2.7.0...3.0.0) --- updated-dependencies: - dependency-name: mathjax dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump org.apache.shiro:shiro-core from 1.10.0 to 1.13.0 (apache#4703) Bumps [org.apache.shiro:shiro-core](https://github.com/apache/shiro) from 1.10.0 to 1.13.0. - [Release notes](https://github.com/apache/shiro/releases) - [Changelog](https://github.com/apache/shiro/blob/main/RELEASE-NOTES) - [Commits](apache/shiro@shiro-root-1.10.0...shiro-root-1.13.0) --- updated-dependencies: - dependency-name: org.apache.shiro:shiro-core dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump follow-redirects from 1.15.3 to 1.15.4 in /zeppelin-web (apache#4702) Bumps [follow-redirects](https://github.com/follow-redirects/follow-redirects) from 1.15.3 to 1.15.4. - [Release notes](https://github.com/follow-redirects/follow-redirects/releases) - [Commits](follow-redirects/follow-redirects@v1.15.3...v1.15.4) --- updated-dependencies: - dependency-name: follow-redirects dependency-type: indirect ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump follow-redirects from 1.15.3 to 1.15.4 in /zeppelin-web-angular (apache#4701) Bumps [follow-redirects](https://github.com/follow-redirects/follow-redirects) from 1.15.3 to 1.15.4. - [Release notes](https://github.com/follow-redirects/follow-redirects/releases) - [Commits](follow-redirects/follow-redirects@v1.15.3...v1.15.4) --- updated-dependencies: - dependency-name: follow-redirects dependency-type: indirect ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * [MINOR] Upgrade jackson version in /zeppelin-server (apache#4635) * [HOTFIX] Disable shell interpreter (apache#4708) * Fix GCSNotebookRepoTests (apache#4711) * [HOTFIX] Check permission when updating cron information (apache#4631) * [HOTFIX] Check permission when updating cron information * [HOTFIX] Fix commented * [HOTFIX] Check permission when updating cron information * [HOTFIX] Check permission when updating cron information * [HOTFIX] Check permission when updating cron information * [MINOR] Change minimum java version to 11 in docs (apache#4710) * [MINOR] Change minimum java version to 11 in docs * [MINOR] Change minimum java version to 11 in docs * [HOTFIX] Change the link of `helium.json` from S3 to zeppelin.apache.org (apache#4713) * [ZEPPELIN-5990] Disable sensitive configuration for JDBC url (apache#4709) * [ZEPPELIN-5990] Disable sensitive configuration for JDBC url * [ZEPPELIN-5990] Disable sensitive configuration for JDBC url * [ZEPPELIN-5995] Update Kubernetes Library and hopefully fix flaky tests (apache#4712) * [MINOR] Set Snapshot version to 0.12.0-SNAPSHOT (apache#4720) * change version to 0.11.1-SNAPSHOT * change version * change to 0.12.0-SNAPSHOT * Bump org.postgresql:postgresql from 42.4.3 to 42.7.2 in /jdbc (apache#4723) Bumps [org.postgresql:postgresql](https://github.com/pgjdbc/pgjdbc) from 42.4.3 to 42.7.2. - [Release notes](https://github.com/pgjdbc/pgjdbc/releases) - [Changelog](https://github.com/pgjdbc/pgjdbc/blob/master/CHANGELOG.md) - [Commits](https://github.com/pgjdbc/pgjdbc/commits) --- updated-dependencies: - dependency-name: org.postgresql:postgresql dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * [HOTFIX] Escape Ldap search filters (apache#4714) * [HOTFIX] Escape envs when using `.conf` (apache#4715) * Fix the new zeppelin ui. The specific reason is that the use of excessively wide column widths resulted in the remaining fields being squeezed out of the screen, and nz table did not have a scroll bar set to display scrolling. (apache#4727) * Bump ip from 1.1.8 to 1.1.9 in /zeppelin-web (apache#4724) Bumps [ip](https://github.com/indutny/node-ip) from 1.1.8 to 1.1.9. - [Commits](indutny/node-ip@v1.1.8...v1.1.9) --- updated-dependencies: - dependency-name: ip dependency-type: indirect ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * Bump lodash from 4.17.15 to 4.17.21 in /zeppelin-web-angular (apache#4689) Bumps [lodash](https://github.com/lodash/lodash) from 4.17.15 to 4.17.21. - [Release notes](https://github.com/lodash/lodash/releases) - [Commits](lodash/lodash@4.17.15...4.17.21) --- updated-dependencies: - dependency-name: lodash dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * [HOTFIX] Escape HeliumPackage information (apache#4728) * [NO-ISSUE] Use reload4j instead of log4j (apache#4719) * [NO-ISSUE] Upgrade org.json jar (apache#4722) * [ZEPPELIN-6001] k8s images fix (apache#4729) * Update zeppelin-interpreter Dockerfile fixed ARG version; Error "bzip2: Compressed file ends unexpectedly;" fixed by adding ---no-iri flag to wget * Update zeppelin-server Dockerfile fixed ARG version; fixed JAVA_HOME env. * Update scripts/docker/zeppelin-interpreter/Dockerfile Co-authored-by: Philipp Dallig <[email protected]> --------- Co-authored-by: Philipp Dallig <[email protected]> * [ZEPPELIN-6000] Polish some files mainly in zengine (apache#4731) * some misc polish * some misc polish * [ZEPPELIN-6003] Log source info of SQL in JDBCInterpreter (apache#4732) * [ZEPPELIN-6003] Log detail info of SQL in JDBCInterpreter * Update Co-authored-by: Philipp Dallig <[email protected]> --------- Co-authored-by: Philipp Dallig <[email protected]> * [ZEPPELIN-6002] Fix completer NPE (apache#4730) Co-authored-by: Philipp Dallig <[email protected]> * [ZEPPELIN-5986] Bump Maven surefire/failsafe plugins to recover JUnit5 tests (apache#4734) * [ZEPPELIN-5986] Re-enable Junit 5 integration tests by upgrading maven plugins * Fix SparkIntegrationTest * Disable the LivyInterpreterIT * Selenium * NPE * Disable testEditOnDoubleClick * [ZEPPELIN-6005] Update Kyuubi JDBC docs (apache#4738) * Bump org.apache.commons:commons-configuration2 from 2.8.0 to 2.10.1 (apache#4740) Bumps org.apache.commons:commons-configuration2 from 2.8.0 to 2.10.1. --- updated-dependencies: - dependency-name: org.apache.commons:commons-configuration2 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * [MINOR] Fix missing changed versions (apache#4737) * [MINOR] Fix missing changed versions * nit * Bump com.nimbusds:nimbus-jose-jwt in /zeppelin-server (apache#4733) Bumps [com.nimbusds:nimbus-jose-jwt](https://bitbucket.org/connect2id/nimbus-jose-jwt) from 9.13 to 9.37.2. - [Changelog](https://bitbucket.org/connect2id/nimbus-jose-jwt/src/master/CHANGELOG.txt) - [Commits](https://bitbucket.org/connect2id/nimbus-jose-jwt/branches/compare/9.37.2..9.13) --- updated-dependencies: - dependency-name: com.nimbusds:nimbus-jose-jwt dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * [ZEPPELIN-6007] Enhance release scripts for tar shasum commands detection (apache#4747) * [ZEPPELIN-6007] Enhance release scripts for tar shasum commands detection * fix * nit * indent * [ZEPPELIN-6008] Fix parameter usage of bokeh in test case (apache#4748) * [ZEPPELIN-6008] Pin plotly 5.19.0 * Revert "[ZEPPELIN-6008] Pin plotly 5.19.0" This reverts commit 60d9ce9. * Pin bokeh=3.3.4 * Revert "Pin bokeh=3.3.4" This reverts commit 44ccc98. * Remove usage of deprecated paramater legend * [ZEPPELIN-5969] Remove Hadoop2 and move to Hadoop3 shaded client (apache#4691) * Drop hadoop2 in github actions * Update docs * Drop hadoop2 support * Remove hadoop2 integration tests * findbugs use the same version in all modules * Use hadoop3.3 for tests * Move to scala 2.12 * Try to fix flink * Usage of metals * Remove duplicate version and groupid * Fix Flink with Hadoop3 * fix log * R * fix * fix * fix * fix * hadoop-3.3 * fix * fix * Address comments * address comments --------- Co-authored-by: Philipp Dallig <[email protected]> * [HOTFIX] Remove rendering helium description as HTML in Frontend (apache#4755) * Bump express from 4.18.2 to 4.19.2 in /zeppelin-web-angular (apache#4744) Bumps [express](https://github.com/expressjs/express) from 4.18.2 to 4.19.2. - [Release notes](https://github.com/expressjs/express/releases) - [Changelog](https://github.com/expressjs/express/blob/master/History.md) - [Commits](expressjs/express@4.18.2...4.19.2) --- updated-dependencies: - dependency-name: express dependency-type: indirect ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * update scripts/docker/zeppelin/bin/Dockerfile to JDK11 (apache#4753) * update scripts/docker/zeppelin/bin/Dockerfile to JDK11 * update miniconda to py3.9 * Update Dockerfile to remove extra line * Bump some apache common libs (apache#4757) * [ZEPPELIN-6006] Remove command line applications when downloading applications (apache#4746) * Move Files with java * Use java to download external dependecies * Improve code after review * Correct Mirror-URL and compilation * [ZEPPELIN-6016] Rewrite and enable Livy integration tests (apache#4743) * wip * nit * nit * wip * wip * fix * [ZEPPELIN-5973] Bump Livy 0.8.0-incubating * nit * Spark 3.5.1 * test * fix * comment * nit * nit * nit * [ZEPPELIN-6017] Revert changes about ZEPPELIN_IDENT_STRING in ZEPPELIN-5421 * [ZEPPELIN-6015] Update ci-action plugins (apache#4759) * [ZEPPELIN-5999] Reduce instance objects from Zeppelin (apache#4726) * Remove ZeppelinConfiguration Singelton and add MiniZeppelinServer * Add ZeppelinConfiguration to Interpreter * Remove static pluginmanager and configstorage * Inject servicelocator into SessionConfiguratior * use custom serviceLocator in integration tests * Reorder code * code cleanup * Add ZeppelinConfiguration as class variable to InterpreterOption * Avoid leaking third-party libs * [ZEPPELIN-6019] Remove Submarine support (apache#4763) * [ZEPPELIN-6022] Skip decryption of credentials.json when file is empty (apache#4765) * Skip decryption when empty * Use more elegant empty json string check * [ZEPPELIN-6018] Update gRPC version from 1.51.0 to 1.55.1 for successful Apache Zeppelin build on s390x architecture (apache#4758) * Update grpc version in pom.xml for successful Apache Zeppelin build on s390x architecture In the s390x architecture, the Apache Zeppelin package builds successfully when the grpc dependency version in the pom.xml file is changed from version 1.51.0 to version 1.62.2. Therefore, I have updated the grpc version in the pom.xml file. * Update LICENSE Updated the grpc version in LICENSE from 1.51.0 to 1.62.2 * grpc-version-change-to 1.55.1 * [ZEPPELIN-6027] Enhanced Integration with Apache Kyuubi (apache#4770) * [ZEPPELIN-6027] Enhanced Integration with Apache Kyuubi * fix style * [ZEPPELIN-6028] Enhance default value assignment for ZEPPELIN_IDENT_STRING (apache#4772) * [ZEPPELIN-6029] Set COPYFILE_DISABLE=1 for macOS tar (apache#4774) * [ZEPPELIN-6029] Add --disable-copyfile for macOS tar * COPYFILE_DISABLE=1 * ODP-1644: Removed unsupported interpreter * ODP-1315: Bumpup loadash to 4.x * ODP: fix odp version * ODP-303 New UI build fix for missing projects * [ODP-1315] ODP-1644: CVE fixes * ODP-1644: Update to Spark Version * ODP-1644: Added libthrift, updated hadoop and phoenix versions * ODP-1644: Hive 4.0.0 support for JDBC * ODP-1644: Added two variables that were previously missed * ODP-1644: Fixed node/npm version issue when testing with arm64 MacOS * ODP-1644: Increased timeouts/sleeps to pass tests * ODP-1644: Increased ms in Thread.sleep * ODP-1829: Updated pom.xml files * ODP-1829: Removed uneeded tests * ODP-1829: Fixed tests failures * ODP-1829: Disabled Finicky Helium Test * ODP-1829: Increased wait time so that tests would not fail * ODP-1829: Thread.Sleep -> Thread.sleep * Zeppelin Build fixes * ODP-1644: Updated version number to 11.2.3.3.6.0-1 --------- Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: bigpear0201 <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Shefali Singh <[email protected]> Co-authored-by: Jongyoul Lee <[email protected]> Co-authored-by: Philipp Dallig <[email protected]> Co-authored-by: PJ Fanning <[email protected]> Co-authored-by: myongyun <[email protected]> Co-authored-by: th1nksnow <[email protected]> Co-authored-by: Manhua <[email protected]> Co-authored-by: Manhua <[email protected]> Co-authored-by: Cheng Pan <[email protected]> Co-authored-by: Cheng Pan <[email protected]> Co-authored-by: Gayle <[email protected]> Co-authored-by: zeotuan <[email protected]> Co-authored-by: Aditi Sharma <[email protected]> Co-authored-by: Prabhjyot Singh <[email protected]> Co-authored-by: shubhamsharma <[email protected]>

pan3793 marked this pull request as draft November 13, 2023 07:27

matthias-koch reviewed Nov 15, 2023

View reviewed changes

pan3793 force-pushed the hadoop3 branch 3 times, most recently from da4bb5e to 167b290 Compare November 25, 2023 19:55

pan3793 force-pushed the hadoop3 branch 2 times, most recently from 7e72132 to b629b1d Compare March 20, 2024 11:49

pan3793 changed the title ~~DO-N0T-MERGE Move to Hadoop3~~ Remove Hadoop2 and move to Hadoop3 with shaded client Mar 23, 2024

pan3793 commented Mar 23, 2024

View reviewed changes

flink/flink-scala-2.12/pom.xml Show resolved Hide resolved

pan3793 commented Mar 23, 2024

View reviewed changes

flink/flink-scala-2.12/pom.xml Show resolved Hide resolved

pan3793 commented Mar 23, 2024

View reviewed changes

hbase/pom.xml Show resolved Hide resolved

pan3793 marked this pull request as ready for review March 23, 2024 14:41

pan3793 requested a review from Reamer March 23, 2024 14:43

pan3793 commented Mar 23, 2024

View reviewed changes

pan3793 changed the title ~~Remove Hadoop2 and move to Hadoop3 with shaded client~~ Remove Hadoop2 and move to Hadoop3 shaded client Mar 23, 2024

Reamer requested changes Mar 25, 2024

View reviewed changes

flink/flink-scala-2.12/pom.xml Show resolved Hide resolved

alluxio/pom.xml Show resolved Hide resolved

hbase/pom.xml Show resolved Hide resolved

hbase/pom.xml Outdated Show resolved Hide resolved

pom.xml Outdated Show resolved Hide resolved

rlang/pom.xml Outdated Show resolved Hide resolved

Reamer added 8 commits April 2, 2024 15:15

Drop hadoop2 in github actions

90a7fd0

Update docs

e0ce2dc

Drop hadoop2 support

25e6210

Remove hadoop2 integration tests

a11e15d

findbugs use the same version in all modules

b4baea9

Use hadoop3.3 for tests

575a32f

Move to scala 2.12

a5d10c5

Try to fix flink

88921e4

Reamer and others added 13 commits April 2, 2024 15:15

Usage of metals

a18a24a

Remove duplicate version and groupid

4adc2c9

Fix Flink with Hadoop3

24e4be9

fix log

c56d699

R

fc625b9

fix

5319fd8

fix

426a9ca

fix

391a8af

fix

0328ee3

hadoop-3.3

ec9d08e

fix

2354a2c

fix

3c8d9b8

Address comments

230c9a1

pan3793 force-pushed the hadoop3 branch from c0274c6 to 230c9a1 Compare April 2, 2024 07:15

address comments

0a73a87

Reamer mentioned this pull request Apr 2, 2024

[ZEPPELIN-5969] Remove Hadoop2 Support #4674

Closed

2 tasks

pan3793 changed the title ~~Remove Hadoop2 and move to Hadoop3 shaded client~~ [ZEPPELIN-5969] Remove Hadoop2 and move to Hadoop3 shaded client Apr 2, 2024

Reamer approved these changes Apr 2, 2024

View reviewed changes

Reamer merged commit fa6e3ee into apache:master Apr 3, 2024
27 of 28 checks passed

pan3793 deleted the hadoop3 branch April 6, 2024 16:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ZEPPELIN-5969] Remove Hadoop2 and move to Hadoop3 shaded client #4691

[ZEPPELIN-5969] Remove Hadoop2 and move to Hadoop3 shaded client #4691

pan3793 commented Nov 13, 2023 •

edited

Loading

matthias-koch Nov 15, 2023

pan3793 Nov 16, 2023

matthias-koch Nov 16, 2023

Reamer Nov 16, 2023

matthias-koch Nov 16, 2023

pan3793 Nov 27, 2023

Reamer Nov 27, 2023

pan3793 Nov 27, 2023 •

edited

Loading

Reamer Nov 27, 2023

pan3793 Mar 23, 2024

pan3793 commented Nov 27, 2023

Reamer commented Nov 27, 2023

pan3793 commented Dec 1, 2023 •

edited

Loading

pan3793 Mar 23, 2024 •

edited

Loading

pan3793 commented Mar 23, 2024

pan3793 Mar 23, 2024

pan3793 Mar 23, 2024 •

edited

Loading

Reamer left a comment

pan3793 commented Apr 1, 2024

pan3793 commented Apr 2, 2024

Reamer commented Apr 2, 2024

Armadik commented May 23, 2024

pan3793 commented May 24, 2024

Armadik commented May 24, 2024 •

edited

Loading

Armadik commented May 24, 2024

pan3793 commented Jul 3, 2024

Armadik commented Aug 1, 2024 •

edited

Loading

[ZEPPELIN-5969] Remove Hadoop2 and move to Hadoop3 shaded client #4691

[ZEPPELIN-5969] Remove Hadoop2 and move to Hadoop3 shaded client #4691

Conversation

pan3793 commented Nov 13, 2023 • edited Loading

What is this PR for?

What type of PR is it?

What is the Jira issue?

How should this be tested?

Screenshots (if appropriate)

Questions:

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pan3793 Nov 27, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pan3793 commented Nov 27, 2023

Reamer commented Nov 27, 2023

pan3793 commented Dec 1, 2023 • edited Loading

pan3793 Mar 23, 2024 • edited Loading

Choose a reason for hiding this comment

pan3793 commented Mar 23, 2024

Choose a reason for hiding this comment

pan3793 Mar 23, 2024 • edited Loading

Choose a reason for hiding this comment

Reamer left a comment

Choose a reason for hiding this comment

pan3793 commented Apr 1, 2024

pan3793 commented Apr 2, 2024

Reamer commented Apr 2, 2024

Armadik commented May 23, 2024

pan3793 commented May 24, 2024

Armadik commented May 24, 2024 • edited Loading

Armadik commented May 24, 2024

pan3793 commented Jul 3, 2024

Armadik commented Aug 1, 2024 • edited Loading

pan3793 commented Nov 13, 2023 •

edited

Loading

pan3793 Nov 27, 2023 •

edited

Loading

pan3793 commented Dec 1, 2023 •

edited

Loading

pan3793 Mar 23, 2024 •

edited

Loading

pan3793 Mar 23, 2024 •

edited

Loading

Armadik commented May 24, 2024 •

edited

Loading

Armadik commented Aug 1, 2024 •

edited

Loading