Remove Unwanted Characters Advanced

This node removes unwanted characters

Input

It accepts a DataFrame as input from the previous Node

Type

transform

Class

fire.nodes.etl.NodeRemoveUnwantedCharactersMultiple

Fields

Name

Title

Description

inputCols

Input Columns

Input columns

removeWhitespaces

Remove Whitespaces

Removes white space

removeLetters

Remove Letters

Removes letters

removeDigits

Remove Digits

Removes digits

removeSigns

Remove Signs

Removes signs

removeCommas

Remove Commas

Removes commas

Details

This node removes unwanted characters from the specified input columns

Examples

Incoming Dataframe has following row:

 CUST_CD  |  CUST_NAME  |  SALARY    |    COMMENTS
-------------------------------------------------------------
 C01      |  MIKE       | $12,500.00 |  Salary hiked by 10%

If removeunwantedcharactersmultiple is configured to remove unwanted characters [SIGNS] & [COMMAS] from [SALARY] & [DIGITS] and [SIGNS] from [COMMENTS] columns

then outgoing Dataframe would result as below:

 CUST_CD  |  CUST_NAME  |  SALARY    |    COMMENTS
-------------------------------------------------------------
 C01      |  MIKE       | 1250000    |  Salary hiked by

Below examples detail how data is processed based on selected options

REMOVE WHITESPACES is selected as [True]

Whitespaces are removed from the selected columns [SALARY] and [COMMENTS]

 CUST_CD  |  CUST_NAME  |  SALARY    |    COMMENTS
-------------------------------------------------------------
 C01      |  MIKE       | $12,500.00 |  Salaryhikedby10%

REMOVE LETTERS is selected as [True]

Alphabets are removed from the selected columns [SALARY] and [COMMENTS]

 CUST_CD  |  CUST_NAME  |  SALARY    |    COMMENTS
-------------------------------------------------------------
 C01      |  MIKE       | $12,500.00 |  10%

REMOVE DIGITS is selected as [True]

Numbers are removed from the selected columns [SALARY] and [COMMENTS]

 CUST_CD  |  CUST_NAME  |  SALARY    |    COMMENTS
-------------------------------------------------------------
 C01      |  MIKE       | $,.        |  Salary hiked by %

REMOVE SIGNS is selected as [True]

Special characters are removed from the selected columns [SALARY] and [COMMENTS]

 CUST_CD  |  CUST_NAME  |  SALARY    |    COMMENTS
-------------------------------------------------------------
 C01      |  MIKE       | 1250000    |  Salary hiked by 10

REMOVE COMMAS is selected as [True]

Commas are removed from the selected columns [SALARY] and [COMMENTS]

 CUST_CD  |  CUST_NAME  |  SALARY    |    COMMENTS
-------------------------------------------------------------
 C01      |  MIKE       | $12500.00  |  Salary hiked by 10%