Min Max Scaler¶
MinMaxScaler transforms a dataset of Vector rows, rescaling each feature to a specific range (often [0, 1])
Input¶
It takes in a DataFrame as input and transforms it to another DataFrame
Output¶
A new column containing the scaled features is added to the incoming DataFrame
Type¶
ml-estimator
Class¶
fire.nodes.etl.NodeMinMaxScaler
Fields¶
Name |
Title |
Description |
|---|---|---|
inputCol |
Input Column |
The input column name |
outputCol |
Output Column |
The output column name |
max |
Max |
The upper bound after transformation, shared by all features |
min |
Min |
The lower bound after transformation, shared by all features |
Details¶
Min Max Scaler Transform Node Details¶
The Min Max Scaler Transform Node is used to rescale a dataset of Vector rows by transforming each feature to a specific range, often [0, 1]. It takes no additional parameter other than the input DataFrame and the output column name.
It rescales each feature individually to a specific range by dividing through the largest minimum-maximum value in each feature and it adds a new column to the DataFrame with the scaled feature values.
Input Parameters¶
INPUT DATAFRAME: The dataframe containing the feature column to be transformed.
OUTPUT COLUMN : The name of the output column after rescaling.
Examples¶
Min Max Scaler Transform Node Example¶
Consider the below Min Max Scaler output for the features column
id features scaled_features
0 [1.0, 2.0, 3.0, 4.0] [0.0, 0.33, 0.67, 1.0]
1 [-1.0, -2.0, -3.0, -4.0] [0.0, 0.33, 0.67, 1.0]
In this example, the input column is features and the output column is scaled_features. The minmax scaler scales the features individually to the range [0,1],by dividing through the largest minimum-maximum value in each feature. Here the minimum value is -4.0 and the maximum value is 4.0 so the features are divided by (4.0 - (-4.0)) = 8.0.