Read JDBC

This node reads data from Relational Databases using JDBC and creates a DataFrame from it

Input

It reads data from Relational Databases

Output

It creates a DataFrame from the data read and sends it to its output

Type

dataset

Class

fire.nodes.dataset.NodeDatasetJDBCUsingConnection

Fields

Name

Title

Description

connection

Connection

The JDBC connection to connect

jdbcDatabase

Database

Database for connecting to the JDBC

jdbctable

Table

JDBC Table from which to read the data

schema

InferSchema

outputColNames

Column Names of the Table

Output Columns Names of the Table

outputColTypes

Column Types of the Table

Output Column Types of the Table

outputColFormats

Column Formats

Output Column Formats

performance

Performance

fetchsize

Fetch Size

The JDBC fetch size, which determines how many rows to fetch per round trip. This can help performance on JDBC drivers which default to low fetch size (e.g. Oracle with 10 rows).

queryTimeout

Query Timeout

The number of seconds the driver will wait for a Statement object to execute. Zero means there is no limit.

sessionInitStatement

Session Init Statement

After each database session is opened to the remote DB and before starting to read data, this parameter executes a custom SQL statement (or a PL/SQL block). Use this to implement session initialization code. Example: option(“sessionInitStatement”, “BEGIN execute immediate ‘alter session set “_serial_direct_read”=true’; END;”).

partitionColumn

Partition Column

PartitionColumn must be a numeric, date, or timestamp column from the table

lowerBound

Lower Bound

LowerBound and UpperBound are just used to decide the partition stride, not for filtering the rows in the table. All rows in the table will be partitioned and returned. This option applies only to reading

upperBound

Upper Bound

LowerBound and UpperBound are just used to decide the partition stride, not for filtering the rows in the table. All rows in the table will be partitioned and returned. This option applies only to reading

numPartitions

Num Partitions

The maximum number of partitions that can be used for parallelism in table reading

properties

Properties

extraOptionsKeys

Properties Name

Extra options/properites available while executing in Read JDBC.

extraOptionsValues

Properties Value

Config Values for the Corresponding properites name

Details

Read JDBC Node Details

This node reads data from Relational Databases using JDBC and creates a DataFrame from it.

Parameters to be set:

General:

  • OUTPUT STORAGE LEVEL: Keep this as DEFAULT.

  • CONNECTION: Select the desired JDBC connection to be used.

  • DB TABLE: Specify the table from which data is to be read.

Performance:

  • PARTITION COLUMN: The column used to partition the table data for parallel reads.

  • LOWER BOUND: The minimum value for the partition column to start data partitioning.

  • UPPER BOUND: The maximum value for the partition column to end data partitioning.

  • NUM PARTITIONS: Number of partitions for parallel data processing.

  • FETCH SIZE: Number of rows to fetch per call; helps optimize data retrieval performance.

Schema:

  • Schema Columns: Refresh the schema for loading column name,column type,column format.