Skip to content
mukunku edited this page Jun 19, 2018 · 30 revisions

How to Open a Parquet File for Viewing:

  1. Go to File → Open (CTRL + O)

  1. Select the Parquet File you'd like to open

  1. Once file analysis is done, select the fields you would like loaded.
  • The lesser the amount of fields selected the faster the data will be loaded
  • File analysis might take several minutes depending on the Parquet file's size (Multiple GB's)

  1. By default the first 1000 records will be displayed in the UI
  • This is because records need to be loaded into memory in order to be displayed
  • This can be adjusted by the settings in the top-right corner

Selected Fields

Changed your mind and want to load a different set of fields from the open parquet file?

  • Go to Edit → Add/Remove Fields (CTRL + F)

This will let you skip the File Analysis step


Record Offset (Navigating Files with Many Records)

By default the first 1000 records are loaded and displayed within the utility.
In order to look at the second 1000, for example, you must increase the Record Offset option:

This option can be located at the top right portion of the UI

Example

  1. Let's say we have a parquet file with 28000 records.
  2. By default the utility will load the first 1000: Record Offset: 0 - Record Count: 1000
  3. This can be seen in the bottom right of the status bar:

  1. In order to look at the next 1000 records, you must increase the Record Offset by 1000:
  2. The utility will automatically load the next 1000 records
  3. This can be confirmed by looking at the status bar:

You can increase the record count to view a larger window of records.

Clone this wiki locally