KNN: s3_input with Pipe/CSV/GZIP not working #2361
Unanswered
moebelde-rs
asked this question in
Help
Replies: 1 comment
-
Thanks for using SageMaker! I've reached out to the relevant team to see if they have any insight. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I got some strange result when trying to train the KNN estimator with a gzipped CSV (what should be possible according to the docs).
This is my input setup:
With this input the KNN only processes a fraction of samples. If the CSV contains 1k samples and sample_size is also set to this value the output is:
Here you see that only 32 samples were processed... the number also changes each time.
If I use unzipped CSVs the training works as expected...
Thanks in advance
Rob
Beta Was this translation helpful? Give feedback.
All reactions