Sklearn Random Forest Regression¶

Random Forest Regression, fits a number of classifying decision trees on various sub-samples of the dataset and uses averaging to improve the predictive accuracy and control over-fitting. The sub-sample size is controlled with the max_samples parameter if bootstrap=True (default), otherwise the whole dataset is used to build each tree.

Type¶

ml-estimator

Class¶

fire.nodes.sklearn.NodeSklearnRandomForestRegression

Fields¶

Name	Title	Description
targetCol	Target Column	The label column for model fitting
featureCols	Feature Columns	Feature columns of type - all numeric, boolean and vector
splitRatio	Split Ratio	Split Ratio
n_estimators	Number of Estimators	Specifies the number of trees in the forest.
criterion	Criterion	The function to measure the quality of a split.
max_depth	Max Depth	The maximum depth of the tree. If not set, nodes are expanded until all leaves are pure or until all leaves contain less than min_samples_split samples.
min_samples_split	Min Samples Split	The minimum number of samples required to split an internal node. Higher values prevent a tree from growing too complex.
min_samples_leaf	Min Samples Leaf	The minimum number of samples required to be at a leaf node.
min_weight_fraction_leaf	Min Weight Fraction Leaf	The minimum weighted fraction of the sum total of weights required to be at a leaf node.
max_features	Max Features	The number of features to consider when looking for the best split.
max_leaf_nodes	Max Leaf Nodes	Grow trees with max_leaf_nodes in best-first fashion. If not set, then unlimited number of leaf nodes.
min_impurity_decrease	Min Impurity Decrease	A threshold for early stopping in tree growth. A node will split if its impurity is above the threshold, otherwise it is a leaf.
bootstrap	Bootstrap	Whether bootstrap samples are used when building trees.
oob_score	Oob Score	Whether to use out-of-bag samples to estimate the R^2 on unseen data.
random_state	Random State	Controls the randomness of the bootstrapping of the samples used when building trees (if bootstrap=True). Default value is None
warm_start	Warm Start	When warm_start is true, the existing fitted model attributes an are used to initialise the new model in a subsequent call to fit.

Details¶

More details are available at : https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestRegressor.html