Run HIVEQL =========== This node runs the given SQL on the incoming DataFrame Input -------------- This node takes in a DataFrame as input and transforms it to another DataFrame Output -------------- This node runs the given SQL on the incoming DataFrame to generate the output DataFrame Type --------- transform Class --------- fire.nodes.etl.NodeRunHiveQL Fields --------- .. list-table:: :widths: 10 5 10 :header-rows: 1 * - Name - Title - Description * - hql - HiveQL - HIVE Query Language - HiveQL Details ------- Run HIVEQL Details +++++++++++++++ This node runs the HiveQL on the incoming DataFrame to transform it into another DataFrame. HiveQL is a query language used to process and manipulate the data stored in Apache Hive. The input to this node is a DataFrame and the output is another DataFrame generated by running the given HiveQL on the input DataFrame. Examples ------- Run HIVEQL Examples +++++++++++++++ Calculate Average Age of People by Gender +++++++++++++++ SELECT gender, AVG(age) as avg_age FROM df GROUP BY gender; This HiveQL statement calculates the average age of people grouped by gender. The result is a new DataFrame with two columns, 'gender' and 'avg_age'.