Dataset Structured¶
This Node creates a DataFrame by reading data from HDFS, HIVE etc. The dataset was defined earlier in Fire by using the Dataset Feature. As a user, you just have to select the Dataset of your interest.
Input¶
It reads in data from HIVE or files in HDFS
Output¶
It creates a DataFrame from the input data and sends it to its output.
Type¶
dataset
Class¶
fire.nodes.dataset.NodeDatasetStructured
Fields¶
Name |
Title |
Description |
|---|---|---|
dataset |
Dataset |
Selected Dataset |
Details¶
This Node creates a DataFrame by reading data from HDFS, HIVE etc.
The data has been defined earlier in Fire by using the Dataset Feature. As a user, you just have to select the Dataset of your interest.
Examples¶
Dataset Structured Node Example¶
Scenario:
Let’s say you have multiple datasets available in your workflow and you want to select one of them as input for the next node. You can use the Dataset Structured node to choose the desired dataset.
Configuration:
Output Storage Level: Select the desired storage level for the output DataFrame.
Dataset: Choose the dataset from the dropdown list.
Output:
The node will output the selected dataset as a DataFrame.