Split¶
This node splits the incoming DataFrame into 2. It takes in the fraction to use in splitting the data. For example, if the fraction is .7, it would split the data into 2 DataFrames, one containing 70% of the rows(passed from lower edge id to next node) and the other containing the remaining 30%(passed from higher edge id to next node).
Input¶
It takes in a DataFrame as input
Output¶
The input DataFrame is split into 2 DataFrames and output
Type¶
transform
Class¶
fire.nodes.ml.NodeSplit
Fields¶
Name |
Title |
Description |
|---|---|---|
fraction1 |
Fraction 1 |
Fraction to be used for Splitting the DataFrame into two. The first DataFrame would go to the lower edge output. The other would go to the higher edge output. |
seed |
Seed |
Random number for Reproducibility |
Details¶
This node splits the incoming DataFrame into 2. It takes in the fraction to use in splitting the data.
For example, if the fraction is .7, it would split the data into 2 DataFrames, one containing 70% of the rows(passed from lower edge id to next node) and the other containing the remaining 30%(passed from higher edge id to next node).
The split node can be used for splitting the DataFrame for training and test datasets used in Machine Learning.
Examples¶
lets say you pass a table with 5 rows and the fraction .6(60%),the first and second outcomes of the node will give dataframes as follows
First
PartID SupplierID PartName
StringType StringType StringType
P9271 S798 Part_D
P523 S955 Part_K
P3201 S332 Part_M
Second
PartID SupplierID PartName
StringType StringType StringType
P9634 S527 Part_G
P9345 S850 Part_M