Open NLP Document Categorizer¶
This node classifies text into pre-defined categories using OpenNLP - https://opennlp.apache.org/documentation/1.7.2/manual/opennlp.html#tools.doccat. It takes in the OpenNLP model. Models can be downloaded from http://opennlp.sourceforge.net/models-1.5/
Input¶
It takes in a DataFrame as input.
Output¶
It finds the Document Category and stores the result in the specified output column.
Type¶
transform
Class¶
fire.nodes.opennlp.NodeOpenNLPDocumentCategorizer
Fields¶
Name |
Title |
Description |
|---|---|---|
model |
Model |
Path to the model file (on HDFS when running on the cluster) |
inputCol |
Input Text Column |
input cpulmn name |
outputCol |
Output Column |
Output Column containing the results |
Details¶
This node classifies text into pre-defined categories using OpenNLP
https://opennlp.apache.org/documentation/1.7.2/manual/opennlp.html#tools.doccat.
It takes in the OpenNLP model. Models can be downloaded from http://opennlp.sourceforge.net/models-1.5/