Read Parquet¶
Dataset Node for reading Apache Parquet Files
Input¶
It reads in Parquet files
Output¶
It creates a DataFrame from the data read and sends it to its output
Type¶
dataset
Class¶
fire.nodes.dataset.NodeDatasetParquet
Fields¶
Name |
Title |
Description |
|---|---|---|
path |
Path |
Path of the Parquet file/directory |
addInputFileName |
Add Input File Name |
Add the new field:input_file_name |
schema |
InferSchema |
|
outputColNames |
Column Names for the Parquet |
Output Columns of the Parquet |
outputColTypes |
Column Types for the Parquet |
Data Type of the Output Columns |
outputColFormats |
Column Formats for the Parquet |
Format of the Output Columns |
Details¶
Read Parquet Node Details¶
This node reads a Parquet file and creates the DataFrame which contains the schema and data of the specified Parquet file.
Parameters to be set:¶
OUTPUT STORAGE LEVEL : Keep this as DEFAULT.
PATH : Specify the path of the Parquet file to be read.
ADD INPUT FILE NAME : Select if the Parquet file name needs to be added to the DataFrame.
SCHEMA COLUMNS : Refresh the schema of the DataFrame.