Paragraph Splitter¶
Type¶
transform
Class¶
fire.nodes.etl.NodeParagraphSplitter
Fields¶
Name |
Title |
Description |
|---|---|---|
filePath |
Path |
Path for text file to be parsed |
outputCol |
Output Column |
Name for output column |
dropTopRows |
Drop First N rows |
Drop Top N Rows |
dropBottomRows |
Drop Last N rows |
Drop Bottom N Rows |
Details¶
Paragraph Splitter Node¶
Overview:¶
The Paragraph Splitter node splits text data into paragraphs based on specified delimiters. It’s particularly useful for processing text data like articles, blog posts, or emails.
Input:¶
Input Column: The column containing the text data to be split.
Paragraph Separator: The delimiter used to separate paragraphs (e.g., “\n\n”).
Drop First N Rows: The number of rows to skip from the beginning.
Drop Last N Rows: The number of rows to skip from the end.
Output:¶
The node outputs a new column containing the split paragraphs.
Examples¶
Example:
Let’s assume we have a column named “text” containing the following text:
This is the first paragraph.
This is the second paragraph.
This is the third paragraph.
Configure the Node:
Input Column: text
Paragraph Separator: “\n\n”
Node Execution:
The node will split the text into three paragraphs and create a new column containing the split paragraphs.