-
Notifications
You must be signed in to change notification settings - Fork 101
Home
mukunku edited this page Jun 19, 2018
·
30 revisions
- Go to File → Open (CTRL + O)
- Select the Parquet File you'd like to open
- Once file analysis is done, select the fields you would like loaded.
- The lesser the amount of fields selected the faster the data will be loaded
- File analysis might take several minutes depending on the Parquet file's size (Multiple GB's)
- By default the first 1000 records will be displayed in the UI
- This is because records need to be loaded into memory in order to be displayed
- This can be adjusted by the settings in the top-right corner
Changed your mind and want to load a different set of fields from the open parquet file?
- Go to Edit → Add/Remove Fields (CTRL + F)
This will let you skip the File Analysis step
By default the first 1000 records are loaded and displayed within the utility.
In order to look at the second 1000, for example, you must increase the Record Offset option:
This option can be located at the top right portion of the UI
- Let's say we have a parquet file with 28000 records.
- By default the utility will load the first 1000: Record Offset: 0 - Record Count: 1000
- This can be seen in the bottom right of the status bar:
- In order to look at the next 1000 records, you must increase the Record Offset by 1000:
- The utility will automatically load the next 1000 records
- This can be confirmed by looking at the status bar:
You can increase the record count to view a larger window of records.