Read Parquet¶

Dataset Node for reading Apache Parquet Files

Input¶

It reads in Parquet files

It creates a DataFrame from the data read and sends it to its output

dataset

fire.nodes.dataset.NodeDatasetParquet

Name	Title	Description
path	Path	Path of the Parquet file/directory
addInputFileName	Add Input File Name	Add the new field:input_file_name
schema	InferSchema
outputColNames	Column Names for the Parquet	Output Columns of the Parquet
outputColTypes	Column Types for the Parquet	Data Type of the Output Columns
outputColFormats	Column Formats for the Parquet	Format of the Output Columns

This node reads a Parquet file and creates the DataFrame which contains the schema and data of the specified Parquet file.

OUTPUT STORAGE LEVEL : Keep this as DEFAULT.
PATH : Specify the path of the Parquet file to be read.
ADD INPUT FILE NAME : Select if the Parquet file name needs to be added to the DataFrame.
SCHEMA COLUMNS : Refresh the schema of the DataFrame.