Tika

Reads in files from a given path and parses them with Apache Tika

Type

dataset

Class

fire.nodes.dataset.NodeDatasetTika

Fields

Name

Title

Description

path

Path

Path of the file/directory

fileNameCol

File Name Column

File Name Column in the Output DataFrame

contentCol

Content Column

Tika output column in the Output DataFrame