Ozone ofs in presto #7259
Replies: 10 comments 28 replies
-
Yes, ofs is a Hadoop compatible protocol and should be able to be used by Presto. Our Ozone cluster is currently being used by Presto to query Hive tables on Ozone (instead of HDFS). |
Beta Was this translation helpful? Give feedback.
-
Hi @ivandika3 Thanks for the quick response. I tried the OFS protocol by adding the jar
|
Beta Was this translation helpful? Give feedback.
-
No, the client is running in my local. As you mentioned the above PR is using S3G. Is there any idea how this internal IP is getting to the client. I am not aware of the internal communication inside the ozone. I guess it will work If I managed to supply the public IP instead of internal IP to the client. But not sure how it can be done. |
Beta Was this translation helpful? Give feedback.
-
Sure @ivandika3 i will try it out. |
Beta Was this translation helpful? Give feedback.
-
This is not only the issue from presto. I followed the below documentation to try locally using hadoop I can create volume and schema using
|
Beta Was this translation helpful? Give feedback.
-
Does the client directly interact with data nodes both during read and write? |
Beta Was this translation helpful? Give feedback.
-
@ivandika3 - I tried presto and ozone in the same machine. This is working with hive catalog in presto. But I tried the remote connectivity to an ozone instance in cloudera. It throws the same error I posted in the previous thread. The client is trying to access the internal IP of the datanode which is in the cloudera VM. So this is not only with the docker setup. Is there a way I can specify datanode host at the client side using ozone-site.xml or hdfs-site.xml? |
Beta Was this translation helpful? Give feedback.
-
@lukmanulhakkeem try using Docker's host network driver. This eliminates container-specific IP addresses and NAT. On the flip side, you'll need to configure different set of ports for each service with multiple instances (e.g. each datanode) to avoid port conflict. |
Beta Was this translation helpful? Give feedback.
-
@ivandika3 @adoroszlai - I cloned the repo and checkout to But I am getting the below error in presto when used the above jar But it was working fine with the below Jar from the maven repository. https://mvnrepository.com/artifact/org.apache.ozone/ozone-filesystem-hadoop3/1.4.0 Is there any change in the protobuf in ozone? |
Beta Was this translation helpful? Give feedback.
-
@ivandika3 @adoroszlai Thanks for all the support. I am just briefing what I have done here. I wanted to run queries in Presto using Apache Ozone as the storage backend with the OFS protocol. To achieve this, I added the Ozone JAR to the required modules in Presto. The connection was successful when Ozone and Presto were running on the same machine. However, it did not work when Ozone and Presto were on different machines (e.g., two separate cloud VMs). Upon investigation, I noticed that the Ozone Manager (OM) always returns the internal IP addresses of the DataNodes during read and write operations. Based on @ivandika3's guidance, I added a configuration in Ozone to use the hostnames of the DataNodes instead of their internal IPs for read and write operations. I tested the above changes on an Ozone instance in Cloudera, and the setup worked fine with Presto using this configuration. However, the issue persists when Ozone is running inside a Docker container, as the hostname of the Docker container differs from what is expected. I attempted to change the hostname of the Docker container, but it still did not resolve the problem. As a result, we are unable to remotely access Ozone storage using the OFS protocol. This issue is not limited to Presto; it can also be reproduced with the hdfs dfs command. Is it possible for the community to provide a fix for this issue in the future? |
Beta Was this translation helpful? Give feedback.
-
Hi folks, I am able to connect to ozone using the iceberg catalog from presto using the s3 gateway. But is it possible to connect to ozone from presto using the ofs protocol? Did anyone try this?
Beta Was this translation helpful? Give feedback.
All reactions