You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Once you have the later version of the 6 JARs in the bin folder restart the spark-shell and re-create the file using the same code above.
This new file won't be openable with Parquet.NET
Failing test
//Following code will fail with provided test filesusingStreams=System.IO.File.OpenRead(@"C:\Users\Sal\source\repos\parquet-dotnet\src\Parquet.Test\data\parquet-mr-1.14.3.parquet");usingParquetReaderr=awaitParquetReader.CreateAsync(s);
The text was updated successfully, but these errors were encountered:
Library Version
5.0.2
OS
Windows
OS Architecture
64 bit
How to reproduce?
Summary
Parquet files created in Apache Spark using Parquet-Java (formerly parquet-mr) 1.14.0+ can't be opened.
Sample files: Samples.zip
System.InvalidOperationException: don't know how to skip type 14
ystem.InvalidOperationException: don't know how to skip type Double
All provided files are openable using https://www.parquet-viewer.com/ so they appear valid.
How do I generate these files myself?
Below are the steps I followed on my ARM Macbook to generate these files:
./bin/spark-shell
sample_parquet_file.parquet
in the directory.bin
folder:bin
folder for Spark to usebin
folder restart thespark-shell
and re-create the file using the same code above.Failing test
The text was updated successfully, but these errors were encountered: