Run HIVEQL

This node runs the given SQL on the incoming DataFrame

Input

This node takes in a DataFrame as input and transforms it to another DataFrame

Output

This node runs the given SQL on the incoming DataFrame to generate the output DataFrame

Type

transform

Class

fire.nodes.etl.NodeRunHiveQL

Fields

Name

Title

Description

hql

HiveQL - HIVE Query Language

HiveQL

Details

Run HIVEQL Details

This node runs the HiveQL on the incoming DataFrame to transform it into another DataFrame. HiveQL is a query language used to process and manipulate the data stored in Apache Hive.

The input to this node is a DataFrame and the output is another DataFrame generated by running the given HiveQL on the input DataFrame.

Examples

Run HIVEQL Examples

Calculate Average Age of People by Gender

SELECT gender, AVG(age) as avg_age FROM df GROUP BY gender;

This HiveQL statement calculates the average age of people grouped by gender. The result is a new DataFrame with two columns, ‘gender’ and ‘avg_age’.