Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] "don't know how to skip type Set" error #118

Closed
cetindogu opened this issue Aug 26, 2024 · 5 comments
Closed

[BUG] "don't know how to skip type Set" error #118

cetindogu opened this issue Aug 26, 2024 · 5 comments
Labels
duplicate This issue or pull request already exists parquet-dotnet-bug

Comments

@cetindogu
Copy link

i cant view a parquet file.
line below throws error -> "don't know how to skip type Set"
var parquetReader = await ParquetReader.CreateAsync(parquetFilePath, null, cancellationToken);

The error message "don't know how to skip type Set" in Parquet typically indicates that the Parquet reader or writer you're using does not support the Set data type directly. Parquet, which is a columnar storage format, has a limited set of primitive data types and some complex types (like arrays, maps, and structs), but it doesn't directly support Java specific types like Set. !!!!

@cetindogu cetindogu added the bug Something isn't working label Aug 26, 2024
@cetindogu
Copy link
Author

there are missing compactTypes

namespace Parquet.Meta.Proto {
enum CompactType : byte {
Stop = 0x00,
BooleanTrue = 0x01,
BooleanFalse = 0x02,
Byte = 0x03,
I16 = 0x04,
I32 = 0x05,
I64 = 0x06,
Double = 0x07,
Binary = 0x08,
List = 0x09,
Set = 0x0A,
Map = 0x0B,
Struct = 0x0C,
Uuid = 0x0D
}
}

1: STOP
2: TRUE
3: FALSE
4: BYTE
5: I16 (16-bit integer)
6: I32 (32-bit integer)
7: I64 (64-bit integer)
8: DOUBLE (64-bit floating point)
9: BINARY (String or binary data)
10: LIST (List collection)
11: SET (Set collection)
12: MAP (Map collection)
13: STRUCT (Structured object)
14: VOID
15: FIXED_LENGTH_BINARY

@miaocb
Copy link

miaocb commented Oct 23, 2024

The same problem, expect to be resolved.

@mukunku
Copy link
Owner

mukunku commented Dec 22, 2024

Any chance y'all could share a sample file? You can upload it directly here by zipping it.

I also see you opened an issue in the Parquet.net repo as well: aloneguid/parquet-dotnet#544 I'll see if I can recreate the test file you mentioned but no promises.

@cetindogu
Copy link
Author

no, i can't. it is a business file.

@mukunku
Copy link
Owner

mukunku commented Jan 9, 2025

This is an issue with the parquet-dotnet library we use. I opened an issue ticket here: aloneguid/parquet-dotnet#583

Going to close this out as a duplicate of #121 . Since there's no sample file in this thread, we can continue discussions there.

@mukunku mukunku added parquet-dotnet-bug duplicate This issue or pull request already exists and removed bug Something isn't working labels Jan 9, 2025
@mukunku mukunku closed this as completed Jan 9, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
duplicate This issue or pull request already exists parquet-dotnet-bug
Projects
None yet
Development

No branches or pull requests

3 participants