One Hot Encoder
===========

Maps a column of label indices to a column of binary vectors, with at most a single one-value

Input
--------------
It takes in a DataFrame and transforms it to another DataFrame

Output
--------------
The output DataFrame contains a new column which contains the mapping of a column of label indices to a column of binary vectors, with at most a single one-value.

Type
--------- 

ml-transformer

Class
--------- 

fire.nodes.ml.NodeOneHotEncoder

Fields
--------- 

.. list-table::
      :widths: 10 5 10
      :header-rows: 1

      * - Name
        - Title
        - Description
      * - inputCols
        - Input Columns
        - Input columns for encoding
      * - outputCols
        - Output Columns
        - Output columns


Details
-------
One Hot Encoder Node Details
+++++++++++++++


One-hot encoding is a process by which categorical data (such as nominal data) are converted into numerical features of a dataset. This is often a required preprocessing step since machine learning models require numerical data.


One Hot Encoder Node maps a column of label indices to a column of binary vectors, with at most a single one-value.This encoding allows algorithms which expect continuous features, such as Logistic Regression, to use categorical features.


Input Parameters
+++++++++++++++


* OUTPUT STORAGE LEVEL : Keep this as DEFAULT.
* VARIABLES : Add the columns to be one-hot encoded
* Input Columns : Select the categorical field in the input schema for one-hot encoding.
* Output Columns : Set the name of the field in the output schema that contains the binary vector result.


Examples
-------
One Hot Encoder Node Example
+++++++++++++++


Assume that we have the following DataFrame with column v1 as integer data type:

::

    v1 |
    ---|
    1  |
    2  |
    3  |
    4  |
    5  |


The output dataframe that will be obtained by selecting **v1** as the Input columns and **vecOut** as the Output column name will be :


::

     v1 |     vecOut    |
    ----|---------------|
     1  | (5,[1],[1.0]) |
     2  | (5,[2],[1.0]) |
     3  | (5,[3],[1.0]) |
     4  | (5,[4],[1.0]) |
     5  | (5,[],[])     |